Skip to content
srdee edited this page Jul 30, 2015 · 3 revisions

Part of the conversion process is converting HTML entities to their UTF-8 equivalents.

In the past, some of the files have been converted using the html2uni() function in the epidocConversions.scala script, while Perseus-specific entities were converted using find-and-replace functionality also in this script, plus currentTransforms.txt.

However, moving forward, HTML entity conversions should be completed using the Perseus DTDs in the DTD folder of the repo.

For more on DTDs, see the W3C DTD tutorial.

Clone this wiki locally