We standardize feature dimension, sample distribution and semantic labeling, provide paired modal data and fusion annotation resources, suitable for underlying algorithm training of multimodal models. The dataset covers general scenes, industry professionalism and game creative content, with stable feature distribution and high sample diversity. Strict screening and deduplication ensure no redundant interference data, and standardized labeling rules maintain feature correlation consistency.