[Combined PR] Security Hardening, Standardization, and Robustness Improvements #507

Ashutosh0x · 2026-01-13T07:49:11Z

Summary of Improvements

This PR consolidates several critical engineering improvements for the Gemma repository:

1. Security Hardening for Calculator (#469)

Safe Evaluation: Replaced unsafe eval() with a strict AST-based evaluator (_SafeEvaluator).
Whitelisting: Strictly permitted mathematical operations, constants (pi, e), and 10+ core functions.
Precision: Implemented standardized float formatting and scientific notation handling.

2. Architectural Standardization

Terminology Alignment:
- Renamed num_embed -> vocab_size (25+ files) for consistency with industry standards.
- Renamed attention_types -> layers_types to support non-attention layer types (e.g., identity).
Python Compatibility: Updated legacy match statements to if/elif blocks for Python 3.8/3.9 compatibility.

3. JAX Performance & Readability

Performance: Refactored core transformer token extraction to use jnp.take_along_axis in _transformer.py and gemma3n/_transformer.py, following maintainer TODO recommendations.

4. Data Pipeline Robustness (#504)

Resilience: Hardened _decode_bytes in _tasks.py with errors='replace' to prevent crashes on invalid UTF-8 sequences.
Testing: Added permanent unit tests in gemma/gm/data/_tasks_test.py.

5. Quality Assurance

Cleanup: Fixed multiple typos in examples and internal docstrings (Issue Typo in multiple files #423).
Maintenance: Removed stale TODO comments after verifying feature completion.

Verified through exhaustive unit tests, architectural audits, and compilation checks.

This commit addresses: 1. Security Fix (gemma google-deepmind#469): Replaces unsafe eval() with AST-based _SafeEvaluator in Calculator tool. 2. Architecture: Renames num_embed to vocab_size across the codebase for consistency. 3. Compatibility: Fixes legacy math statement SyntaxErrors for Python 3.8/3.9. 4. Cleanup: Removes stale nucleus sampling TODO.

… tests - Uses errors='replace' in _decode_bytes to prevent UnicodeDecodeError. - Adds gemma/gm/data/_tasks_test.py for permanent verification.

…nology - Fixes multiple typos in classification example and transformer comments (google-deepmind#423). - Refactors last token slicing to use jnp.take_along_axis in core Transformer. - Renames attention_types to layers_types repository-wide for architectural consistency.

….8 compatibility

Ashutosh0x added 3 commits January 13, 2026 13:17

Fix _decode_bytes robustness (gemma google-deepmind#504) and add unit…

ceaebce

… tests - Uses errors='replace' in _decode_bytes to prevent UnicodeDecodeError. - Adds gemma/gm/data/_tasks_test.py for permanent verification.

Ashutosh0x changed the title ~~Harden Calculator security (#469) and standardize transformer terminology~~ [Combined PR] Security Hardening, Standardization, and Robustness Improvements Jan 13, 2026

Ashutosh0x added 3 commits January 13, 2026 13:37

Add PR_DESCRIPTION.md summarizing consolidated improvements

ce076b6

Refactor ToolSampler to use proper ToolTurn for conversation history

19ced49

feat: Refine ToolSampler with proper ToolTurn formatting and Python 3…

83aa04d

….8 compatibility

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Combined PR] Security Hardening, Standardization, and Robustness Improvements #507

[Combined PR] Security Hardening, Standardization, and Robustness Improvements #507

Uh oh!

Ashutosh0x commented Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Combined PR] Security Hardening, Standardization, and Robustness Improvements #507

Are you sure you want to change the base?

[Combined PR] Security Hardening, Standardization, and Robustness Improvements #507

Uh oh!

Conversation

Ashutosh0x commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary of Improvements

1. Security Hardening for Calculator (#469)

2. Architectural Standardization

3. JAX Performance & Readability

4. Data Pipeline Robustness (#504)

5. Quality Assurance

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ashutosh0x commented Jan 13, 2026 •

edited

Loading