Point Clouds Annotation
Project Structure Example
Main concepts
Point cloud Project
Point cloud Project consists of one or many datasets of point clouds.
It also includes Sensor fusion feature that supports video camera sensor in the Labeling Tool UI.
Project Meta (meta.json)
Project Meta contains the essential information about the project - Classes and Tags. These are defined project-wide and can be used for labeling in every dataset inside the current project.
Datasets (<DATASET_NAME_1>, <DATASET_NAME_2>, ...)
Datasets are the second level folders inside the project, they host subsets of point cloud scenes, related photo context (images) and annotations.
Items/Point clouds (pointcloud)
Every .pcd
file in a sequence has to be stored inside a pointcloud
folder of datasets.
Key | Value |
---|---|
x | The x coordinate of the point. |
y | The y coordinate of the point. |
z | The z coordinate of the point. |
r | The red color channel component. An 8-bit value (0-255). |
g | The green color channel component. An 8-bit value (0-255) |
b | The blue color channel component. An 8-bit value (0-255) |
All the positional coordinates (x, y, z) are in meters. Supervisely supports all PCD encoding: ASCII, binary, binary_compressed.
The PCD file format description can be found here
Items Annotations (ann)
Point cloud Annotations refer to each point cloud and contains information about labels on the point clouds in the datasets.
A dataset has a list of objects
that can be shared between some point clouds.
The list of objects
is defined for the entire dataset, even if the object's figure occurs in only one point cloud.
Figures
represents individual labels, attached to one single frame and its object.
Optional fields and loading These fields are optional and are not needed when loading the project. The server can automatically fill in these fields while project is loading.
id
- unique identifier of the current objectclassId
- unique class identifier of the current objectlabelerLogin
- string - the name of user who created the current figurecreatedAt
- string - date and time of figure creationupdatedAt
- string - date and time of the last figure update
Main idea of key
fields and id
you can see below in Key id map file section.
Fields definitions:
description
- string - (optional) - this field is used to store the text to assign to the sequence.key
- string, unique key for a given sequence (used in key_id_map.json to get the sequence ID)tags
- list of strings that will be interpreted as point cloud tagsobjects
- list of objects that may be present on the datasetgeometryType
- "cuboid_3d" or other 3D geometry - class shape
Fields definitions for objects
field:
key
- string - unique key for a given object (used in key_id_map.json)classTitle
- string - the title of a class. It's used to identify the class shape from themeta.json
filetags
- list of strings that will be interpreted as object tags (can be empty)
Fields description for figures
field:
key
- string - unique key for a given figure (used in key_id_map.json)objectKey
- string - unique key to link figure to object (used in key_id_map.json)geometryType
- "cuboid_3d" or other 3D geometry -class shapegeometry
- geometry of the object
Description for geometry
field (cuboid_3d):
position
3D vector of box center coordinates:x - forward in the direction of the object
y - left
z - up
dimensions
is a 3D vector that scales a cuboid from its local center along x, y, z:x - width
y - length
z - height
rotation
is a 3D Vector that rotates a cuboid along an axis in world space:x - pitch
y - roll
z - yaw (direction)
Rotation values bound inside [-pi ; pi] When yaw = 0
box direction will be strict +y
Key id map file
The basic idea behind key-id-map is that it maps the unique identifiers of entities from Supervisely to local entities keys. It is needed for such local data manipulations as cloning entities and reassigning relations between them. Examples of entities in key_id_map.json
: datasets (videos), tags, objects, figures.
objects
- dictionary, where the key is a unique string, generated inside Supervisely environment to set mapping of current object in annotation, and values are unique integer ID related to the current objectfigures
- dictionary, where the key is a unique string, generated inside Supervisely environment to set mapping of object on current frame in annotation, and values are unique integer ID related to the current framevideos
- dictionary, where the key is unique string, generated inside Supervisely environment to set mapping of dataset in annotation, and value is a unique integer ID related to the current sequencetags
- dictionary, where the keys are unique strings, generated inside Supervisely environment to set mapping of tag on current frame in annotation, and values are a unique integer ID related to the current tagKey - generated by python3 function
uuid.uuid4().hex
. The unique string. All key values and ID's should be unique inside single project and can not be shared between frames\sequences.Value - returned by server integer identifier while uploading object / figure / sequence / tag
Format of frame_pointcloud_map.json
This file stores mapping between point cloud files and annotation frames in the correct order.
Keys - frame order number Values - point cloud name (with extension)
Photo context image annotation file
Fields description:
name - string - Name of image file
entityId (OPTIONAL) - integer >= 1 ID of the Point Cloud in the system, that photo attached to. Doesn't required while uploading.
deviceId - string - Device ID or name.
timestamp - (OPTIONAL) - string - Time when the frame occurred in ISO 8601 format
sensorsData - Sensors data such as Pinhole camera model parameters. See wiki: Pinhole camera model and OpenCV docs for 3D reconstruction.
intrinsicMatrix - Array of number - 3x3 flatten matrix (dropped last zeros column) of intrinsic parameters in row-major order, also called camera matrix. It's used to denote camera calibration parameters. See Intrinsic parameters.
extrinsicMatrix - Array of number - 4x3 flatten matrix (dropped last zeros column) of extrinsic parameters in row-major order, also called joint rotation-translation matrix. It's used to denote the coordinate system transformations from 3D world coordinates to 3D camera coordinates. See Extrinsic_parameters.
Related apps
1. Import Point Cloud Project app.
2. Export pointclouds project in Supervisely format app.
Example projects
2. Demo pointcloud project with labels
Last updated