DigitalChild Documentation¶
Welcome to the complete documentation for DigitalChild (LittleRainbowRights), a Python pipeline for analyzing human rights documents with focus on child and LGBTQ+ digital protections.
Quick Links¶
-
Getting Started
Install and run your first pipeline
-
REST API
Access data programmatically via 14 endpoints
-
Scorecard
Explore 10 indicators across 194 countries
-
Help & Support
Common questions and troubleshooting
Documentation Sections¶
Getting Started¶
- Installation - Set up DigitalChild on your system
- Quick Start - Run your first pipeline in 5 minutes
- First Run Errors - Troubleshooting guide
Core Guides¶
- Runbook - Complete pipeline operations guide
- Scorecard Workflow - Country indicator system
- Production Deployment - Deploy the API in production
API Documentation¶
- API Overview - REST API features and quickstart
- API Reference - All 14 endpoints with examples
- API Quick Reference - Endpoint cheat sheet
Scorecard¶
- Scorecard Overview - What it tracks and why
- Design & Methodology - How indicators are defined
- Data Access - API, CSV, and file access
- Visualization - Charts and exports
- Data Explorer - Interactive exploration
Standards & Specifications¶
- Metadata Schema - Document metadata structure
- Tags Config Format - Tagging system format
- Scraper Structure - How to build scrapers
- File Naming Standards - Naming conventions
Technical Architecture¶
- Architecture - System design overview
- Roadmap - Development phases and progress
Project Information¶
- FAQ - Frequently asked questions
- Contributing - How to contribute
- License - MIT License
Project Structure¶
DigitalChild/
├── pipeline_runner.py # Main entry point
├── scrapers/ # Web scrapers for document sources
├── processors/ # Text extraction and tagging
├── api/ # Flask REST API (Phase 4)
├── data/
│ ├── raw/ # Downloaded documents
│ ├── processed/ # Extracted text
│ ├── metadata/ # Document metadata with tags
│ └── exports/ # CSV exports for analysis
├── configs/ # Tag configurations and URL dictionaries
├── docs/ # This documentation
└── tests/ # Test suite (209 tests)
Key Features¶
- Document Pipeline: Scrape → Process → Tag → Enrich → Export
- REST API: 14 production endpoints with authentication and rate limiting
- Scorecard System: 10 indicators × 194 countries for digital rights analysis
- Flexible Tagging: Regex-based tagging with version control
- Data Quality: Automated validation of 2,543 source URLs
- Open Source: MIT license for code, CC BY 4.0 for data
Support¶
- Issues & Bugs: GitHub Issues
- Discussions: GitHub Discussions
- Questions: FAQ
- Website: grimdata.org
Citation¶
@software{littlerainbowrights2025,
title = {DigitalChild / LittleRainbowRights: Child and LGBTQ+ Digital Rights Analysis Pipeline},
author = {Vollmer, D.T. and Vollmer, S.C.},
year = {2025},
version = {2.0.0},
url = {https://github.com/MissCrispenCakes/DigitalChild},
doi = {10.5281/zenodo.18318098},
license = {MIT}
}
Version: 2.0.0 Last Updated: January 2026 License: MIT (code) / CC BY 4.0 (data)