Learning 3D Part Assembly from a Single Image

Last update: Dec 21, 2022

Related tags

Overview

Learning 3D Part Assembly from a Single Image

This repository contains a PyTorch implementation of the paper:

Learning 3D Part Assembly from A Single Image.
Yichen Li*, Kaichun Mo*, Lin Shao, Minhyuk Sung, Leonidas Guibas,
ECCV 2020

Introduction

Autonomous assembly is a crucial capability for robots in many applications. For this task, several problems such as obstacle avoidance, motion planning, and actuator control have been extensively studied in robotics. However, when it comes to task specification, the space of possibilities remains underexplored. Towards this end, we introduce a novel problem, single-image-guided 3D part assembly, along with a learningbased solution. We study this problem in the setting of furniture assembly from a given complete set of parts and a single image depicting the entire assembled object. Multiple challenges exist in this setting, including handling ambiguity among parts (e.g., slats in a chair back and leg stretchers) and 3D pose prediction for parts and part subassemblies, whether visible or occluded. We address these issues by proposing a two-module pipeline that leverages strong 2D-3D correspondences and assembly-oriented graph message-passing to infer part relationships. In experiments with a PartNet-based synthetic benchmark, we demonstrate the effectiveness of our framework as compared with three baseline approaches.

Dependencies

Python 3.6
CUDA 10.0.
PyTorch. code tested with version 1.3.1
Blender. for visualization of results 2.7.9
(Optional) Tensorboard for visualization of the training process.
For the project it has been used TensorboardX

pip install -r requirements.txt

Chamfer Distance

cd exps/utils/cd
python setup.py install

Dataset

Data is available here: link.

wget http://download.cs.stanford.edu/orion/impartass/assembly_data.zip

Training

Training the segmentation stage first

cd exps/exp_segmentation
sh train.sh

modify your parameters including data_path, exp_name and etc. (see closed issues for details info)

Training the assembly stage

cd exps/exp_assemble
sh train.sh

Pre-trained models

Pretrained weights for the chair category is available at link.

wget http://download.cs.stanford.edu/orion/impartass/chair_weights.zip

Cite

Please cite our work if you find it useful:

@article{li2020impartass,
    title={Learning 3D Part Assembly from a Single Image},
    author={Li, Yichen and Mo, Kaichun and Shao, Lin and Sung, Minghyuk and Guibas, Leonidas},
    journal={European conference on computer vision (ECCV 2020)},
    year={2020}
}

Learning 3D Part Assembly from a Single Image

Related tags

Overview

Learning 3D Part Assembly from a Single Image

Introduction

Dependencies

Dataset

Training

Training the segmentation stage first

Training the assembly stage

Pre-trained models

Cite

Owner

Lolviz - A simple Python data-structure visualization tool for lists of lists, lists, dictionaries; primarily for use in Jupyter notebooks / presentations

An e-commerce company wants to segment its customers and determine marketing strategies according to these segments.

PyTorch implementations of the paper: "DR.VIC: Decomposition and Reasoning for Video Individual Counting, CVPR, 2022"

NALSM: Neuron-Astrocyte Liquid State Machine

CUda Matrix Multiply library.

Unsupervised Learning of Video Representations using LSTMs

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

A curated list of awesome deep long-tailed learning resources.

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

[CVPR'22] Official PyTorch Implementation of Collaborative Transformers for Grounded Situation Recognition

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

A knowledge base construction engine for richly formatted data

Playable Video Generation

TransMorph: Transformer for Medical Image Registration

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

JittorVis - Visual understanding of deep learning models

GPU-accelerated Image Processing library using OpenCL

The code used for the free [email protected] Webinar series on Reinforcement Learning in Finance

A series of Jupyter notebooks with Chinese comment that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.