Skip to main content
. 2025 Jul 1;12:1075. doi: 10.1038/s41597-025-05434-6

Table 1.

Metadata about the Dongba1800 dataset.

Metadata Item Description
Dataset Name Dongba1800: Single-Character Detection in Dongba Manuscripts Dataset
Dataset Description The dataset contains 1,800 images, including 111,702 characters, written by various Dongba people.
Data Resource Harvard-Yenching Library
Collection Time From July 12, 2024, to August 13, 2024
Data Format The data is stored in image format with resolutions ranging from 1200 × 416 to 1201 × 530 pixels.
Metadata Recording Each image file is accompanied by a TXT metadata file containing the character’s position coordinates.
Cultural Sensitivity The dataset includes significant cultural and linguistic values. When using the data, respect for the uniqueness of Naxi culture and language is required. Avoid misunderstanding and misrepresentation of Naxi culture and language.
Data Sharing and Collaboration Researchers are encouraged to collaborate with the Naxi community and data providers to ensure the accuracy and social impact of research results.