Fix white space handling while parsing LRC with word time tags #85

Maxr1998 · 2025-06-14T00:26:16Z

Correctly handle white space between word time tags
Support parsing time tags that surround each non-blank segment

Fixes #83.

andy840119

Only review the first and 3rd commit.
Resolve the review feedback than I'll continuous review the rest of one. Need to make sure that those breaking change is correct.

LrcParser/Parser/Lrc/Lines/LrcLyricParser.cs

LrcParser/Parser/Lrc/Utils/LrcStartTimeUtils.cs

LrcParser.Tests/Parser/Lrc/LrcParserTest.cs

LrcParser.Tests/Parser/Lrc/Lines/LrcLyricParserTest.cs

andy840119

Seems there are several changes within a PR:

Adjust test case.
Doing refactor and add more tests.
(breaking change) Deal with spacing within the lyric.
(breaking change) Adjust the index.
(breaking change but cancelled) not decode the lyric if not start with timing info.

As lrc has not strict rule definition, it's hard for me to decide lots of change within a PR.
Should be better to separate those changes with different PRs.

LrcParser.Tests/Parser/Lrc/Lines/LrcLyricParserTest.cs

LrcParser/Parser/Lrc/Utils/LrcTimedTextUtils.cs

LrcParser/Parser/Lrc/Lines/LrcLyricParser.cs

andy840119

Overall LGTM.

I haven't deeply review LrcParser/Parser/Lrc/Utils/LrcTimedTextUtils.cs but can give it a quick pass because the test case is much enough.

andy840119 · 2025-06-22T15:08:56Z

LrcParser.Tests/Parser/Lrc/Utils/LrcTimedTextUtilsTest.cs

-    [TestCase("<00:51.00><01:29.99><01:48.29><02:31.00><02:41.99>You gotta fight !", "You gotta fight !", new[] { "[0,start]:51000" })] // decode with invalid format.
-    public void TestDecodeWithInvalidFormat(string text, string expectedText, string[] expectedTimeTags)
-    {
-        var (actualText, actualTimeTags) = LrcTimedTextUtils.TimedTextToObject(text);
+        var (actualText, actualTimeTags) = LrcTimedTextUtils.TimedTextToObject(text, lineStartTime);


Have no idea why you remove this test case.

That's the one I moved above, since the parsing itself passes fine. Thus, I don't consider it invalid.

I see.

The old question:
does the lrc define those kinds of case?

Not really 🥲
However, the approach I chose seems logical to me:
<00:51.00><01:29.99><01:48.29><02:31.00><02:41.99>You gotta fight !
There are four segments that have a time defined, but no content. Thus, they don't need to be extracted and will be skipped. The last segment isn't empty, and will have a start time of 02:41.99.

hmmm...
I think should be OK now until someone have complain after.

LrcParser.Tests/Parser/Lrc/Utils/LrcTimedTextUtilsTest.cs

LrcParser.Tests/Parser/Lrc/Lines/LrcLyricParserTest.cs

- Correctly handle white space between word time tags - Support parsing time tags that surround each non-blank segment

pull-request-size bot added the size/XL label Jun 14, 2025

Maxr1998 force-pushed the fix-lrc-parser branch from 3254166 to 03be7eb Compare June 18, 2025 12:50

pull-request-size bot added size/L and removed size/XL labels Jun 18, 2025

Maxr1998 marked this pull request as ready for review June 18, 2025 12:51

Maxr1998 force-pushed the fix-lrc-parser branch 2 times, most recently from c223e2c to 3ebeb81 Compare June 18, 2025 13:47

andy840119 reviewed Jun 18, 2025

View reviewed changes

Maxr1998 force-pushed the fix-lrc-parser branch from 551ab7a to c6aebda Compare June 21, 2025 14:05

andy840119 reviewed Jun 21, 2025

View reviewed changes

LrcParser.Tests/Parser/Lrc/Lines/LrcLyricParserTest.cs Outdated Show resolved Hide resolved

LrcParser/Parser/Lrc/Utils/LrcTimedTextUtils.cs Show resolved Hide resolved

LrcParser/Parser/Lrc/Lines/LrcLyricParser.cs Show resolved Hide resolved

Maxr1998 changed the title ~~Rework LRC parsing with word time tags~~ Fix white space handling while parsing LRC with word time tags Jun 21, 2025

Maxr1998 mentioned this pull request Jun 21, 2025

Add helper to check whether a line starts with a line time tag #86

Merged

Maxr1998 force-pushed the fix-lrc-parser branch 2 times, most recently from 7a976d8 to c32a970 Compare June 21, 2025 17:40

andy840119 added the Breaking change This change might let parsing result not same as before. label Jun 22, 2025

Maxr1998 mentioned this pull request Jun 22, 2025

Fix timestamp format in LrcLyricParserTest #87

Merged

Maxr1998 force-pushed the fix-lrc-parser branch 3 times, most recently from 94a1ac3 to 9e88207 Compare June 22, 2025 13:33

Maxr1998 mentioned this pull request Jun 22, 2025

Don't parse word time tags if multiple line times are present #88

Merged

Maxr1998 force-pushed the fix-lrc-parser branch from 9e88207 to c57f5e1 Compare June 22, 2025 14:40

andy840119 approved these changes Jun 22, 2025

View reviewed changes

Fix white space handling when parsing LRC with word time tags

81381f7

- Correctly handle white space between word time tags - Support parsing time tags that surround each non-blank segment

Maxr1998 force-pushed the fix-lrc-parser branch from c57f5e1 to 81381f7 Compare June 22, 2025 20:17

Adjust the test case.

0c303d8

andy840119 merged commit ae8e5a1 into karaoke-dev:main Jun 23, 2025
3 checks passed

Fix white space handling while parsing LRC with word time tags #85

Fix white space handling while parsing LRC with word time tags #85

Uh oh!

Conversation

Maxr1998 commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andy840119 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andy840119 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andy840119 left a comment

Choose a reason for hiding this comment

Uh oh!

andy840119 Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

Maxr1998 Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

andy840119 Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

Maxr1998 Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

andy840119 Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Maxr1998 commented Jun 14, 2025 •

edited

Loading