IPA2TIPA

ipa2tipa is a script to convert International Phonetic Alphabet (IPA) into TeX IPA (TIPA), an IPA notation for $\LaTeX$.

Usage

Simple Usage

from ipa2tipa import IPA

# Create IPA string and convert to TIPA
ipa = IPA("ˈtʰiː")
tipa = ipa.to_tipa()
print(tipa)  # "t\super{h}i:

# IPA and TIPA are str subclasses
print(isinstance(ipa, str))    # True
print(isinstance(tipa, str))   # True
print(len(ipa))                # 4 (Unicode character count)

Examples

from ipa2tipa import IPA

# Aspiration and length
ipa = IPA("tʰiː")
print(ipa.to_tipa())  # t\super{h}i:

# Nasalization
ipa = IPA("nãɪ̃")
print(ipa.to_tipa())  # n\~{a}\~{I}

# Tone marks
ipa = IPA("tʰjɛn˧˥")
print(ipa.to_tipa())  # t\super{h}jEn\tone{35}

# String operations work as expected
ipa = IPA("ko̞ko̞")
print(ipa.upper())           # KO̞KO̞
print(ipa + " test")         # ko̞ko̞ test

Architecture

The library provides two main classes:

IPA(str) → .to_tipa() → TIPA(str)

Both IPA and TIPA are subclasses of str, representing actual IPA and TIPA strings respectively.

Components

Component	Description
`IPA`	IPA string class with `.to_tipa()` method
`TIPA`	TIPA string class (str subclass)
`UnicodeUnit`	Internal representation of a Unicode character with base and modifiers

UnicodeUnit Structure

UnicodeUnit is used internally to represent the structure of Unicode characters:

base: The base character codepoint (e.g., '0074' for 't')
modifiers: List of modifier codepoints (e.g., aspiration, nasalization)

Conversion Process

Internally, the conversion follows these steps:

Decompose IPA string into Unicode codepoints
Group codepoints into base + modifiers (UnicodeUnit)
Convert to TIPA notation

Files

Name	Content
`README.md`	what you are reading right now
`LICENSE.md`	this script is distributed under MIT License
`ipa2tipa.py`	main script
`ipa2tipa_test.py`	brief unittests implemented with a standard library `unittest`
`uni2tipa/uni2tipa-supsub.csv`	data in the format of `UTF-8 (hex), tipa macro denoting next super/subscript`
`uni2tipa/uni2tipa-tone.csv`	data in the format of `UTF-8 (hex), tipa macro of tone letters`
`uni2tipa/uni2tipa0.csv`	data in the format of `UTF-8 (hex), tipa macro taking 0 args`
`uni2tipa/uni2tipa1.csv`	data in the format of `UTF-8 (hex), tipa macro taking 1 arg`
`uni2tipa/uni2tipa2.csv`	data in the format of `UTF-8 (hex), tipa macro taking 2 args`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IPA2TIPA

Usage

Simple Usage

Examples

Architecture

Components

UnicodeUnit Structure

Conversion Process

Files

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
uni2tipa		uni2tipa
LICENSE.md		LICENSE.md
README.md		README.md
ipa2tipa.py		ipa2tipa.py
ipa2tipa_test.py		ipa2tipa_test.py
tipa2ipa.py		tipa2ipa.py
typedefs.py		typedefs.py

License

t92jp/ipa2tipa

Folders and files

Latest commit

History

Repository files navigation

IPA2TIPA

Usage

Simple Usage

Examples

Architecture

Components

UnicodeUnit Structure

Conversion Process

Files

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages