Handle utf8 strings properly by AzurIce · Pull Request #34 · redstrate/Physis

AzurIce · 2026-02-13T04:14:26Z

closes: #33

Previously all null-terminated string readers used byte as char, which treats each byte as a Latin-1 code point. This corrupts any multi-byte UTF-8 text (e.g. Chinese/Japanese item names in EXD sheets).

Replace every occurrence with proper UTF-8 decoding via String::from_utf8, and extract two reusable helpers in common_file_operations:

read_null_terminated_utf8 (reader-based)
null_terminated_utf8 (byte-slice-based)

Also fix dic.rs where as u8 as char truncated full Unicode code points to 8 bits.

Add 8 unit tests covering ASCII, CJK, empty, and invalid UTF-8 inputs.

Previously all null-terminated string readers used `byte as char`, which treats each byte as a Latin-1 code point. This corrupts any multi-byte UTF-8 text (e.g. Chinese/Japanese item names in EXD sheets). Replace every occurrence with proper UTF-8 decoding via `String::from_utf8`, and extract two reusable helpers in common_file_operations: - `read_null_terminated_utf8` (reader-based) - `null_terminated_utf8` (byte-slice-based) Also fix `dic.rs` where `as u8 as char` truncated full Unicode code points to 8 bits. Add 8 unit tests covering ASCII, CJK, empty, and invalid UTF-8 inputs.

redstrate

Thanks! Didn't think it would be that easy, also tested it locally with Novus and it prints the correct Japanese text.

AzurIce added 2 commits February 13, 2026 12:06

removed unused import

86dc0e9

redstrate approved these changes Feb 13, 2026

View reviewed changes

redstrate merged commit f668799 into redstrate:main Feb 13, 2026
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle utf8 strings properly#34

Handle utf8 strings properly#34
redstrate merged 2 commits intoredstrate:mainfrom
AzurIce:main

AzurIce commented Feb 13, 2026

Uh oh!

redstrate left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AzurIce commented Feb 13, 2026

Uh oh!

redstrate left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants