SceneVerse/preprocess at main · scene-verse/SceneVerse

History

Name		Name	Last commit message	Last commit date
parent directory ..
ssg		ssg
utils		utils
README.md		README.md
__init__.py		__init__.py
arkitscenes.py		arkitscenes.py
build.py		build.py
multiscan.py		multiscan.py
rscan.py		rscan.py
sceneverse2hmsemantic.py		sceneverse2hmsemantic.py

README.md

Data Processing

We have released the preprocessing scripts for 3RScan, MultiScan and ARKitScenes. They are designed to provide a comprehensive framework for data preparation. Taing the 3RScan as an example, the process involves the following steps:

Import raw meshes and annotations from each dataset.
Extract vertices from the mesh and assign both instance and semantic labels to them.
Map the dataset-specific semantic labels to ScanNet 607. This is optional for SceneVerse training but may be required for closed-vocab training (example).
Axis Alignment: Rotate the 3D point clouds so that most 3D object bounding boxes are axis-aligned. This follows ScanRefer, and is currently implemented as a heuristic search (example).
Translation Alignment: Translate the 3D point clouds so that its origin at the center on the floor (example).
Color Alignment: The color value should be within the [0, 255] range (example).

Point subsampling: subsample the point clouds if the number of points exceeds 240K.

PTS_LIMIT = 240000
if out_points.shape[0] > PTS_LIMIT:
    pcd_idxs = np.random.choice(out_points.shape[0], size=PTS_LIMIT, replace=False)
    out_points = out_points[pcd_idxs]
    out_colors = out_colors[pcd_idxs]
    instance_labels = instance_labels[pcd_idxs]

The detailed steps may vary between datasets. Please note the translation and color alignment are critical as they can significantly impact performance. Axis alignment, which requires 3D bounding box annotations, may result in slight but not severe degradation.

3RScan

To reproduce the data preprocessing, download 3RScan and run:

# Preprocess 3RScan 
$ python rscan.py

Adjust the data_root, save_root and num_workers accordingly.

HM3D

As some of our users requested the mapping between HM3D object id in SceneVerse to HM3D-semantics, we have added an additional file (HM3D_tgtID2objID.zip) to obtain this mapping. The json file for each scene contains a dictionary of {<sceneverse_objid>:[hm3d_objid, hm3d_label]}.

Note: The script sceneverse2hmsemantic.py has been deprecated as it cannot reproduce the mappings above. It currently points out how we read the semantics from the annotations in HM3D-semantics.

Prepare for your custom datasets

To prepare your custom data for inference, follow the previous steps and the example script for 3RScan. A convenient way for verification is to use the visualize_data.py. If everything is correct, you should observe the colored point clouds displayed similarly to those in the released version of SceneVerse.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preprocess

preprocess

README.md

Data Processing

3RScan

HM3D

Prepare for your custom datasets

Files

preprocess

Directory actions

More options

Directory actions

More options

Latest commit

History

preprocess

Folders and files

parent directory

README.md

Data Processing

3RScan

HM3D

Prepare for your custom datasets