Skip to content
View LewallenAE's full-sized avatar
  • 21:29 (UTC -07:00)

Block or report LewallenAE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
LewallenAE/README.md

Anthony Lewallen

AI Research Engineer — Python • Systems • EvalOps/RLHF Tooling

LinkedIn Email


What I Do

I build production-grade AI evaluation and data-quality tooling—the systems that make models and AI products reliable:

  • Dataset validation + anomaly detection
  • Evaluation harnesses + repeatable metrics
  • APIs/CLIs + storage + CI gates
  • Performance-minded pipelines (batching, streaming, profiling)

Open Source Contributions

Project Description Link
NLWeb (Microsoft) Implemented CI/CD pipeline — linting, testing matrix, secrets scanning, Dependabot PR #397

Projects

RLHF Data Quality System

A signal-based toolkit for auditing preference datasets (duplicates, formatting issues, refusal bias, readability), designed for real pipelines.

  • Extensible signals framework
  • SQLite → PostgreSQL storage path
  • CLI + API surface
  • CI-friendly: blocks bad data before training

github.com/LewallenAE/rlhf-eval


Tech Stack

Category Tools
Languages Python, TypeScript/JavaScript, Java, SQL
ML/LLM PyTorch, Hugging Face
Backend FastAPI, REST, auth patterns
Data/Infra SQLite, PostgreSQL, Docker, GitHub Actions

How I Build

  • Vertical slices over big rewrites
  • Interfaces + invariants first
  • Tests that prove behavior
  • Logging/metrics as first-class citizens

Open To

Applied Evals • EvalOps • Data Quality Engineering • Backend SWE • ML Systems


Email: anthonylewallen.dev@gmail.com

Popular repositories Loading

  1. forage-midas forage-midas Public

    Forked from vagabond-systems/forage-midas

    Project repo for the JPMC Advanced Software Engineering Forage program

    Java

  2. LewallenAE LewallenAE Public

  3. OctoSodales OctoSodales Public

    Python

  4. NLWeb NLWeb Public

    Forked from nlweb-ai/NLWeb

    Main reference implementation for NLWeb, implemented in Python.

    Python

  5. rlhf-eval rlhf-eval Public

    End-to-end RLHF data quality evaluation harness — detects preference pair pathologies, filters training data, and measures downstream reward model impact

    Python

  6. ground_zero_llm ground_zero_llm Public

    Python