MCPcopy
hub / github.com/vt-vl-lab/3d-photo-inpainting

github.com/vt-vl-lab/3d-photo-inpainting @main sqlite

repository ↗ · DeepWiki ↗
154 symbols 471 edges 10 files 27 documented · 18%
README

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

Open 3DPhotoInpainting in Colab

[Paper] [Project Website] [Google Colab]

We propose a method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view. We use a Layered Depth Image with explicit pixel connectivity as underlying representation, and present a learning-based inpainting model that iteratively synthesizes new local color-and-depth content into the occluded region in a spatial context-aware manner. The resulting 3D photos can be efficiently rendered with motion parallax using standard graphics engines. We validate the effectiveness of our method on a wide range of challenging everyday scenes and show fewer artifacts when compared with the state-of-the-arts.

3D Photography using Context-aware Layered Depth Inpainting

Meng-Li Shih, Shih-Yang Su, Johannes Kopf, and Jia-Bin Huang

In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

Prerequisites

  • Linux (tested on Ubuntu 18.04.4 LTS)
  • Anaconda
  • Python 3.7 (tested on 3.7.4)
  • PyTorch 1.4.0 (tested on 1.4.0 for execution)

and the Python dependencies listed in requirements.txt - To get started, please run the following commands: bash conda create -n 3DP python=3.7 anaconda conda activate 3DP pip install -r requirements.txt conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit==10.1.243 -c pytorch - Next, please download the model weight using the following command: bash chmod +x download.sh ./download.sh

Quick start

Please follow the instructions in this section. This should allow to execute our results. For more detailed instructions, please refer to DOCUMENTATION.md.

Execute

  1. Put .jpg files (e.g., test.jpg) into the image folder.
    • E.g., image/moon.jpg
  2. Run the following command bash python main.py --config argument.yml
    • Note: The 3D photo generation process usually takes about 2-3 minutes depending on the available computing resources.
  3. The results are stored in the following directories:
    • Corresponding depth map estimated by MiDaS
      • E.g. depth/moon.npy, depth/moon.png
      • User could edit depth/moon.png manually.
        • Remember to set the following two flags as listed below if user wants to use manually edited depth/moon.png as input for 3D Photo.
          • depth_format: '.png'
          • require_midas: False
    • Inpainted 3D mesh (Optional: User need to switch on the flag save_ply)
      • E.g. mesh/moon.ply
    • Rendered videos with zoom-in motion
      • E.g. video/moon_zoom-in.mp4
    • Rendered videos with swing motion
      • E.g. video/moon_swing.mp4
    • Rendered videos with circle motion
      • E.g. video/moon_circle.mp4
    • Rendered videos with dolly zoom-in effect
      • E.g. video/moon_dolly-zoom-in.mp4
      • Note: We assume that the object of focus is located at the center of the image.
  4. (Optional) If you want to change the default configuration. Please read DOCUMENTATION.md and modified argument.yml.

License

This work is licensed under MIT License. See LICENSE for details.

If you find our code/models useful, please consider citing our paper:

@inproceedings{Shih3DP20,
  author = {Shih, Meng-Li and Su, Shih-Yang and Kopf, Johannes and Huang, Jia-Bin},
  title = {3D Photography using Context-aware Layered Depth Inpainting},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2020}
}

Acknowledgments

Core symbols most depended-on inside this repo

cat
called by 27
networks.py
out_fmt
called by 13
mesh.py
spectral_norm
called by 12
networks.py
forward_3P
called by 8
networks.py
extrapolate
called by 8
mesh_tools.py
update_info
called by 8
mesh_tools.py
group_edges
called by 8
mesh.py
upsample
called by 7
networks.py

Shape

Function 100
Method 41
Class 13

Languages

Python100%

Modules by API surface

networks.py36 symbols
mesh.py33 symbols
mesh_tools.py30 symbols
utils.py27 symbols
MiDaS/monodepth_net.py13 symbols
MiDaS/MiDaS_utils.py6 symbols
bilateral_filtering.py5 symbols
boostmonodepth_utils.py3 symbols
MiDaS/run.py1 symbols

Dependencies from manifests, versioned

moviepy1.0.2 · 1×
networkx2.3 · 1×
opencv-python4.2.0.32 · 1×
transforms3d0.3.1 · 1×
vispy0.6.4 · 1×

For agents

$ claude mcp add 3d-photo-inpainting \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact