converter from lomaji to hanji in Taiwanese (Hokkien)
Find a file
2024-03-20 01:07:07 +08:00
corpus initial 2024-03-20 01:07:07 +08:00
.gitignore Initial commit 2024-03-20 01:01:45 +08:00
LICENSE initial 2024-03-20 01:07:07 +08:00
model.db initial 2024-03-20 01:07:07 +08:00
pakkau.py initial 2024-03-20 01:07:07 +08:00
README.md initial 2024-03-20 01:07:07 +08:00
result.json initial 2024-03-20 01:07:07 +08:00

pakkau - a lomoji to hanji Taiwanese (Hokkien) converter

A test of Hidden Markov Model converter from lomaji to hanji of Taiwanese (Hokkien). still in alpha version.

Dependencies

  • Python3
  • Pandas

Help

usage: pakkau.py [-h] [--genmod] [--form FORM] [SENTENCE]

positional arguments: SENTENCE the sentence to be converted

options: -h, --help show this help message and exit --genmod generate the model --form FORM the orthography to be used (poj or tl). Default is poj. (not opened)

example1:

python3 ./pakkau.py --form tl "Iâ-soo kóng:guá sī sè-kan ê kng" output:

耶穌講:我是世間的光

example2:

python3 ./pakkau.py --genmod generate models from the .csv parallel transliteration file in ./corpus files

unfinished

poj conversion the preciseness of the conversion