Skip to content

Need process to eliminate bad spaces #4

@amir-zeldes

Description

@amir-zeldes

Some abbreviations have inconsistent whitespace, for example spelling e. g. with space. The tokenizer should have some way of eliminating spaces in these based on a list in some file, possibly producing some annotation that indicates the original spelling (maybe sic+hi@rend="x-space"):

e.g.

Or adding an attribute with the original spelling (could do , though that is not really TEI)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions