-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
User Stories
| # | As a... | I want to... | so that... |
|---|---|---|---|
| 1. | researcher | segment a collection of Tibetan texts | I can do statistics in AntConc |
| 2. | tibetan text proofreader | mark potential errors | I can catch and correct more mistakes |
| 3. | corpus researcher for amdo dialects | create several custom profiles | I can do statistics on different spoken dialects |
| 4. | corpus researcher on literary Tibetan | create a custom profile for the kangyur | I can do accurate statistics on the kangyur and tengyur |
| 5. | |||
Rule based segmentation steps (for story 3 & 4)
- Segment a volume with the default profile
- Create a word list from the volume, ordered by frequency
- Manually cleanup the wordlist
- Use the wordlist as the main list
- Segment the volume again
- Edit the custom profile (word /remove /adjustments) till the segmentation is good
- Merge custom profile with main profile
- Repeat with a second volume
Steps for story # & #
Metadata
Metadata
Assignees
Labels
No labels