converter from lomaji to hanji in Taiwanese (Hokkien)
Find a file
2024-06-28 21:55:08 +08:00
corpus add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00
.gitignore Initial commit 2024-03-20 01:01:45 +08:00
gitignore add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00
LICENSE initial 2024-03-20 01:07:07 +08:00
model.db add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00
pakkau.py add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00
pakkau.py~ add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00
README.md readme 2024-03-20 01:11:22 +08:00
result.json initial 2024-03-20 01:07:07 +08:00
test2.py~ add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00
test3.py~ add change log 2. change the converting unit from hanji, etc. 2024-06-28 21:55:08 +08:00

pakkau - a lomaji to hanji Taiwanese (Hokkien) converter

A test of Hidden Markov Model converter from lomaji to hanji of Taiwanese (Hokkien). still in alpha version.

Dependencies

  • Python3
  • Pandas

Help

usage: pakkau.py [-h] [--genmod] [--form FORM] [SENTENCE]

positional arguments: SENTENCE the sentence to be converted

options: -h, --help show this help message and exit --genmod generate the model --form FORM the orthography to be used (poj or tl). Default is poj. (not opened)

example1:

python3 ./pakkau.py --form tl "Lāu-su kóng: \"ta̍k-ke tsò-hué lâi\"" output:

老師講:"逐家做伙來"

example2:

python3 ./pakkau.py --genmod generate models from the .csv parallel transliteration file in ./corpus files

unfinished

poj conversion the preciseness of the conversion