A professional multimodal dataset consisting of game-themed image-text pairs.
Volume: 500,000 pairs
Format: JPG images + TXT annotation files
Resolution: 1024P and above
Quality: No watermarks, no text overlay, high definition
Content: Game effects, characters, scenes
Safety: Compliant, free from inappropriate and sensitive content