

Symmetria
A Synthetic Dataset for Learning in Point Clouds
Project page of the Symmetria dataset.
Abstract
Unlike in image or text domains that benefit from an abundance of large-scale datasets, point cloud learning techniques frequently encounter limitations due to the scarcity of extensive datasets. To overcome this limitation, we present Symmetria, a formula-driven dataset that can be generated to any arbitrary size. By utilizing the concept of symmetry, we create shapes with known structure and high variability, enabling neural networks to effectively learn point cloud features. Our results demonstrate that this dataset is highly effective for point cloud self-supervised pre-training, yielding models with strong performance in downstream tasks such as classification and segmentation, which also show good few-shot learning capabilities. Additionally, our dataset can be effectively used to fine-tune models to classify real-world objects, highlighting the practical utility and application of our approach. We also introduce a challenging task for symmetry detection and provide a benchmark for baseline comparisons. A significant advantage of our approach is the public availability of the dataset, the accompanying code, and the ability to generate very large collections, promoting further research and innovation in point cloud learning.
Ablation Study
In this section, we present a comprehensive collection of ablation study graphs which were excluded from the main manuscript to ensure brevity and enhance readability. This supplemental material is intended for researchers seeking a rigorous examination of the ablation results, offering deeper insights into the comparative performance and distinctions between the Symmetria Easy and Symmetria Hard datasets. All the data generated during these runs and the scripts used to create these graphs are available here.