
Welcome to ContextGem Documentation!
ContextGem is a free, open-source LLM framework that makes it radically easier to extract structured data and insights from documents — with minimal code.
📚 Project Description
Learn about the motivation, comparisons with other frameworks, and how ContextGem works.
🚀 Getting Started
Instructions to install ContextGem and quickly start using it.
🔍 Extracting Aspects
Learn how to identify and extract specific document sections like clauses, chapters, or terms using ContextGem's Aspects API.
💎 Extracting Concepts
Learn how to extract and infer structured data like JSON objects, strings, numbers, dates, booleans, and ratings from documents using ContextGem's Concepts API.
🤖 Large Language Models
Learn about supported cloud LLM providers and local models, and how to configure and use them for extraction.
🔄 Document Converters
Learn how to use ContextGem's built-in document converters for files such as DOCX.
🔧 Advanced Usage
Explore advanced features and techniques for extracting data from documents.
⚙️ Optimization Guide
Learn how to optimize your extraction pipeline for accuracy, cost, and performance.
💾 Serialization
Learn how to serialize and deserialize ContextGem objects for storage and transfer.
📖 API Reference
Complete API documentation for all ContextGem modules and classes.