Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions CHANGES.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ Changes:

Fixes:
- Modifies rendering of AhocorasickTokenizer parameter in API docs II
- Removed star-pagination markers from extracted text #293

## Current

Expand Down
4 changes: 3 additions & 1 deletion eyecite/clean.py
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,9 @@ def html(html_content: str) -> str:
parent::link |
parent::head |
parent::page-number |
parent::script)]"""
parent::script |
parent::*[@class="star-pagination"]
)]"""
)
return " ".join(text)

Expand Down
9 changes: 8 additions & 1 deletion tests/test_FindTest.py
Original file line number Diff line number Diff line change
Expand Up @@ -828,7 +828,14 @@ def test_find_citations(self):
# Fix for index error when searching for case name
("<p>State v. Luna-Benitez (S53965). Alternative writ issued, dismissed, 342 Or 255</p>",
[case_citation(volume="342", reporter="Or", page="255")],
{'clean_steps': ['html', 'inline_whitespace']})
{'clean_steps': ['html', 'inline_whitespace']}),
# Test remove text with star-pagination class
("<p>The somewhat similar cases of <i>Crane</i> v. <i>Hyde Park,</i> 135 <span class=\"star-pagination\">*355</span> Mass. 147, and <i>Mahoning County</i> v. <i>Young,</i> 16 U.S. App. 253, also cited by the defendant, likewise turned upon a question of forfeiture for breach of a condition subsequent in a deed to a municipal corporation.</p>",
[case_citation(volume="135", reporter="Mass.", page="147",
metadata={"plaintiff": "Crane",
"defendant": "Hyde Park"}
)],
{'clean_steps': ['html', 'inline_whitespace']})
)

# fmt: on
Expand Down
Loading