Documentation
Getting Started
Extract Pages
Run the Python extraction script to convert the Voynich Manuscript PDF into individual high-res page images.
Initialize Database
Set up the SQLite database with page metadata, seed hypotheses, and prepare the analysis pipeline.
Start Exploring
Launch the web app to browse pages, record findings, track hypotheses, and run AI analyses.
cd processing && python extract_pages.py /path/to/manuscript.pdfpython init_db.pycd ../app && npm install && npm run devManuscript Analysis
AI Pipeline
Visual Analysis
Claude vision API analyzes page imagery — illustrations, layouts, glyph patterns.
Glyph Extraction
Segment and catalog individual glyphs from manuscript pages for frequency analysis.
Pattern Recognition
Statistical analysis of glyph sequences, word boundaries, and section correlations.
Technical Reference
REST API
Endpoints for pages, images, hypotheses, findings, and analyses with field-level PATCH.
Database Schema
SQLite (dev) / PostgreSQL (prod) with tables for pages, hypotheses, findings, analyses, annotations.
CLI Tools
Python scripts for PDF extraction, database initialization, and batch processing.
Architecture
Need help or want to contribute?
This project is an experimental AI-powered analysis of the Voynich Manuscript (MS 408, Beinecke Rare Book & Manuscript Library, Yale University).