ertas
    Enterprise
    BlogPricing

    AI Data Formats

    Understand the formats used in AI training, inference, and deployment.

    Alpaca Format

    Conversation

    Instruction-following dataset format for LLM fine-tuning

    ChatML

    Conversation

    Chat Markup Language for structured LLM conversations

    COCO Format

    Annotation

    Microsoft COCO annotation format for object detection and segmentation

    CoNLL

    Annotation

    Column-based annotation format for NER and POS tagging

    CSV for ML Training

    Training Data

    Using CSV files for machine learning training data

    GGUF

    Model Weights

    The universal format for quantized local LLM inference

    JSONL (JSON Lines)

    Training Data

    The standard format for LLM fine-tuning datasets

    ONNX

    Model Weights

    Open Neural Network Exchange format for cross-platform inference

    Parquet

    Training Data

    Columnar storage format for large-scale training datasets

    SafeTensors

    Model Weights

    Safe and fast model weight storage format by HuggingFace

    ShareGPT Format

    Conversation

    Multi-turn conversation format for chat model training

    YOLO Format

    Annotation

    Annotation format for YOLO object detection models

    ertas

    © 2026 Ertas AI.

    Product
    • Pricing
    • Use Cases
    • Integrations
    • Templates
    • Compliance
    Resources
    • Blog
    • Glossary
    • Models
    • Formats
    • Tools
    Compare
    • Alternatives
    • Comparisons
    Company
    • Enterprise
    • Contact