Note
The images downloaded by the below steps are slightly different from the images we downloaded a couple of years back. So, evaluating the images rendered by the pre-trained models with the images downloaded using below steps gives poor scores. We are looking into the discrepancy. In the meantime, if you need to obtain the QA scores of our model on the RE10K dataset for the specified scenes, please re-train SimpleNeRF on the downloaded images.
-
Download the dataset metadata from here and place it in
data/databases/RealEstate10K/downloaded_data/RealEstate10K.tar.gz -
Unzip the downloaded file
cd data/databases/RealEstate10K tar -xzvf downloaded_data/RealEstate10K.tar.gz -C unzipped_data/ cd ../../../
-
Obtain camera data of the five scenes used in ViP-NeRF
cd src/database_utils/real_estate/data_organizers python VideoNameMapper.py -
Run the data extractor file. This requires youtube-dl and ffmpeg to be installed.
python DataExtractor01.py cd ..If youtube-dl is not able to extract uploader-id, reinstall youtube-dl as suggested here.
-
train/test configs are already provided in the repository. In case you want to create them again:
cd train_test_creators/ python TrainTestCreator01.py python VideoPoseCreator01_Original.py cd ..
-
Return to root directory
cd ../../../-
Download the
nerf_llff_data.zipfile from original release in google drive. Place the downloaded file atData/databases/NeRF_LLFF/data/all/nerf_llff_data.zip. -
Run the data extractor file:
cd src/database_utils/nerf_llff/data_organizers/ python DataExtractor01.py cd ..
-
train/test configs are already provided in the repository. In case you want to create them again:
cd train_test_creators/ python TrainTestCreator01_UniformSparseSampling.py python VideoPoseCreator01_Spiral.py cd ..
-
Return to root directory
cd ../../../
We use the Open CV convention: (x, -y, -z) world-to-camera format to store the camera poses.
The camera intrinsics and extrinsics are stored in the csv format after flattening them, i.e., if a scene contains 50 frames, intrinsics and extrinsics are stores as csv files with 50 rows each and 9 & 16 columns respectively.
The directory tree in the following shows an example.
Please refer to one of the data-loaders for more details.
Organize your custom dataset in accordance with the data-loader or write the data-loader file to load the data directly from your custom database format.
Example directory tree:
<DATABASE_NAME>
|--data
|--all
| |--database_data
| |--scene0001
| | |--rgb
| | | |--0000.png
| | | |--0001.png
| | | |-- ...
| | | |--0049.png
| | |--CameraExtrinsics.csv
| | |--CameraIntrinsics.csv
| |--scene0002
| | ...
|--train_test_setsOur code also requires a config file specify the train/validation/test images. Please look into train-test-creators and replicate a similar file for your custom dataset.