Extracting Databases

RealEstate-10K

Note

The images downloaded by the below steps are slightly different from the images we downloaded a couple of years back. So, evaluating the images rendered by the pre-trained models with the images downloaded using below steps gives poor scores. We are looking into the discrepancy. In the meantime, if you need to obtain the QA scores of our model on the RE10K dataset for the specified scenes, please re-train SimpleNeRF on the downloaded images.

Download the dataset metadata from here and place it in data/databases/RealEstate10K/downloaded_data/RealEstate10K.tar.gz

Unzip the downloaded file

cd data/databases/RealEstate10K
tar -xzvf downloaded_data/RealEstate10K.tar.gz -C unzipped_data/
cd ../../../

Obtain camera data of the five scenes used in ViP-NeRF

cd src/database_utils/real_estate/data_organizers
python VideoNameMapper.py

Run the data extractor file. This requires youtube-dl and ffmpeg to be installed.
```
python DataExtractor01.py
cd ..
```
If youtube-dl is not able to extract uploader-id, reinstall youtube-dl as suggested here.

train/test configs are already provided in the repository. In case you want to create them again:

cd train_test_creators/
python TrainTestCreator01.py
python VideoPoseCreator01_Original.py
cd ..

Return to root directory

cd ../../../

NeRF-LLFF

Download the nerf_llff_data.zip file from original release in google drive. Place the downloaded file at Data/databases/NeRF_LLFF/data/all/nerf_llff_data.zip.

Run the data extractor file:

cd src/database_utils/nerf_llff/data_organizers/
python DataExtractor01.py
cd ..

train/test configs are already provided in the repository. In case you want to create them again:

cd train_test_creators/
python TrainTestCreator01_UniformSparseSampling.py
python VideoPoseCreator01_Spiral.py
cd ..

Return to root directory
```
cd ../../../
```

Custom Databases

We use the Open CV convention: (x, -y, -z) world-to-camera format to store the camera poses. The camera intrinsics and extrinsics are stored in the csv format after flattening them, i.e., if a scene contains 50 frames, intrinsics and extrinsics are stores as csv files with 50 rows each and 9 & 16 columns respectively. The directory tree in the following shows an example. Please refer to one of the data-loaders for more details. Organize your custom dataset in accordance with the data-loader or write the data-loader file to load the data directly from your custom database format.

Example directory tree:

<DATABASE_NAME>
 |--data
    |--all
    |  |--database_data
    |     |--scene0001
    |     |  |--rgb
    |     |  |  |--0000.png
    |     |  |  |--0001.png
    |     |  |  |-- ...
    |     |  |  |--0049.png
    |     |  |--CameraExtrinsics.csv
    |     |  |--CameraIntrinsics.csv
    |     |--scene0002
    |     | ...
    |--train_test_sets

Our code also requires a config file specify the train/validation/test images. Please look into train-test-creators and replicate a similar file for your custom dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extracting Databases

RealEstate-10K

NeRF-LLFF

Custom Databases

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Extracting Databases

RealEstate-10K

NeRF-LLFF

Custom Databases