DigitalChild Documentation¶

Welcome to the complete documentation for DigitalChild (LittleRainbowRights), a Python pipeline for analyzing human rights documents with focus on child and LGBTQ+ digital protections.

Quick Links¶

Getting Started

Install and run your first pipeline

Quick Start
REST API

Access data programmatically via 14 endpoints

API Documentation
Scorecard

Explore 10 indicators across 194 countries

Digital Rights Scorecard
Help & Support

Common questions and troubleshooting

FAQ

Documentation Sections¶

Getting Started¶

Installation - Set up DigitalChild on your system
Quick Start - Run your first pipeline in 5 minutes
First Run Errors - Troubleshooting guide

Core Guides¶

Runbook - Complete pipeline operations guide
Scorecard Workflow - Country indicator system
Production Deployment - Deploy the API in production

API Documentation¶

API Overview - REST API features and quickstart
API Reference - All 14 endpoints with examples
API Quick Reference - Endpoint cheat sheet

Scorecard¶

Scorecard Overview - What it tracks and why
Design & Methodology - How indicators are defined
Data Access - API, CSV, and file access
Visualization - Charts and exports
Data Explorer - Interactive exploration

Standards & Specifications¶

Metadata Schema - Document metadata structure
Tags Config Format - Tagging system format
Scraper Structure - How to build scrapers
File Naming Standards - Naming conventions

Technical Architecture¶

Architecture - System design overview
Roadmap - Development phases and progress

Project Information¶

FAQ - Frequently asked questions
Contributing - How to contribute
License - MIT License

Project Structure¶

DigitalChild/
├── pipeline_runner.py      # Main entry point
├── scrapers/               # Web scrapers for document sources
├── processors/             # Text extraction and tagging
├── api/                    # Flask REST API (Phase 4)
├── data/
│   ├── raw/               # Downloaded documents
│   ├── processed/         # Extracted text
│   ├── metadata/          # Document metadata with tags
│   └── exports/           # CSV exports for analysis
├── configs/               # Tag configurations and URL dictionaries
├── docs/                  # This documentation
└── tests/                 # Test suite (209 tests)

Key Features¶

Document Pipeline: Scrape → Process → Tag → Enrich → Export
REST API: 14 production endpoints with authentication and rate limiting
Scorecard System: 10 indicators × 194 countries for digital rights analysis
Flexible Tagging: Regex-based tagging with version control
Data Quality: Automated validation of 2,543 source URLs
Open Source: MIT license for code, CC BY 4.0 for data

Support¶

Issues & Bugs: GitHub Issues
Discussions: GitHub Discussions
Questions: FAQ
Website: grimdata.org

Citation¶

@software{littlerainbowrights2025,
  title = {DigitalChild / LittleRainbowRights: Child and LGBTQ+ Digital Rights Analysis Pipeline},
  author = {Vollmer, D.T. and Vollmer, S.C.},
  year = {2025},
  version = {2.0.0},
  url = {https://github.com/MissCrispenCakes/DigitalChild},
  doi = {10.5281/zenodo.18318098},
  license = {MIT}
}

Version: 2.0.0 Last Updated: January 2026 License: MIT (code) / CC BY 4.0 (data)