Awesome Text-to-3D Plus
A collection of recent methods on 3D generation from text description.
There are mainly 2 kinds of methods of text-to-3D generation:
- Direct End-to-End Generation
(There are multiple internal steps, but they are transparent to the user)
- initialize a coarse layout from text, and then refine/inpaint it
- generate a local scene from text, and then outpaint/optimize it
- Sequential Multi-Stage Generation
(Each internal step has an independent output as the input for the next stage)
- reconstruction based on text-to-image models and depth-estimation models
- reconstruct based on the multi-view generation models from text
- reconstruct a premitive scene from a text-to-image model, then gradually expand it and align features
This repo focuses on the Sequential Multi-Stage Generation approach, and the generation about 3D scene. As for the other topic, please refer to the comprehensive collections listed under Related-Repos-and-Websites at the end of this file. Feel free to submit a pull request if you have relevant papers to add.
Other repos:
About abbreviation: In the list below: B for best paper, S for spotlight, H for highlight, W for workshop.
History
Papers
2020
2021
2022
2023
2024
Scene generation from images
Scene generation from videos