Datamaestro Text API
This section documents the data types (schemas) used to represent datasets. These classes define the structure of datasets and provide methods to access their contents.
Core Domains:
Information Retrieval API - Information retrieval: documents, topics, assessments, training triplets
Conversation API - Conversational IR and query reformulation
Text API - Raw text files and folders
Word Embeddings - Word embeddings
Recommendation - Rating and recommendation datasets
NLP - NLP annotations (CoNLL-U format)
Grand Debat API - French “Grand Debat” contributions