Right now, we segment words into morphs. But in Czech, many morphs are allomorphic. If we could find allomorphs of a single morpheme and mark them as such, i.e. tag them with the morpheme name, the tool could be more useful to actual linguists.
We could do this by clustering the morphs according to their orthography and possibly neighbors.