pakkau/README.md

39 lines
862 B
Markdown
Raw Permalink Normal View History

2024-03-20 01:11:22 +08:00
### pakkau - a lomaji to hanji Taiwanese (Hokkien) converter
2024-03-20 01:03:27 +08:00
A test of Hidden Markov Model converter from lomaji to hanji of Taiwanese (Hokkien). still in alpha version.
## Dependencies
- Python3
- Pandas
## Help
2024-03-20 01:17:59 +08:00
```
2024-03-20 01:03:27 +08:00
usage: pakkau.py [-h] [--genmod] [--form FORM] [SENTENCE]
positional arguments:
SENTENCE the sentence to be converted
options:
-h, --help show this help message and exit
--genmod generate the model
--form FORM the orthography to be used (poj or tl). Default is poj. (not opened)
2024-03-20 01:17:59 +08:00
```
2024-03-20 01:03:27 +08:00
#### example1:
``
2024-03-20 01:11:22 +08:00
python3 ./pakkau.py --form tl "Lāu-su kóng: \"ta̍k-ke tsò-hué lâi\""
2024-03-20 01:03:27 +08:00
``
output:
2024-03-20 01:11:22 +08:00
老師講:"逐家做伙來"
2024-03-20 01:03:27 +08:00
#### example2:
``
python3 ./pakkau.py --genmod
``
generate models from the .csv parallel transliteration file in ./corpus files
## unfinished
2024-03-20 01:17:59 +08:00
- poj conversion
- the preciseness of the conversion