Skip to content

bdhamilton/romero

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

71 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Romero Text Explorer

A text analysis tool for Oscar Romero's homilies (1977-1980). Search for any word or phrase in Spanish and see how its frequency changed over the three years of Romero's preaching.

Quick Start

# 1. Clone repository
git clone https://github.com/bdhamilton/romero.git
cd romero

# 2. Set up Python environment
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

# 3. Run the web app
python app.py
# Open http://localhost:5000

The text explorer is the main page. Browse all homilies at /browse (includes known data issues and links to PDFs on the Romero Trust site).

There's also a CLI tool for terminal use:

python ngram.py pueblo                         # raw count
python ngram.py "pueblo de dios" --norm words  # per 10k words
python ngram.py justicia --norm homilies       # per homily

To rebuild the database from scratch (downloads from Romero Trust, ~20 minutes). This backs up the existing database first:

python scripts/build_database.py

Data Source

All homilies sourced from The Romero Trust, which provides PDFs and audio recordings of Romero's preaching. When complete, this site will just act as a window back to the Romero Trust website--a different way of viewing their data, rather than a duplication of their data.

Coverage:

  • 197 homilies from March 14, 1977 (his first homily as Archbishop) to March 24, 1980 (day of assassination)
  • Spanish text extracted for 193 homilies (4 missing — audio-only or special events)
  • English text extracted for 194 homilies (3 missing)
  • Audio recordings for 172 homilies (~87%)