pakkau

converter from lomaji to hanji in Taiwanese (Hokkien)

Find a file

Tan Kian-ting 9f2836067a add change log 2. change the converting unit from hanji, etc.		2024-06-28 21:55:08 +08:00
corpus	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00
.gitignore	Initial commit	2024-03-20 01:01:45 +08:00
gitignore	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00
LICENSE	initial	2024-03-20 01:07:07 +08:00
model.db	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00
pakkau.py	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00
pakkau.py~	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00
README.md	readme	2024-03-20 01:11:22 +08:00
result.json	initial	2024-03-20 01:07:07 +08:00
test2.py~	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00
test3.py~	add change log 2. change the converting unit from hanji, etc.	2024-06-28 21:55:08 +08:00

README.md

pakkau - a lomaji to hanji Taiwanese (Hokkien) converter

A test of Hidden Markov Model converter from lomaji to hanji of Taiwanese (Hokkien). still in alpha version.

Dependencies

Python3
Pandas

Help

usage: pakkau.py [-h] [--genmod] [--form FORM] [SENTENCE]

positional arguments: SENTENCE the sentence to be converted

options: -h, --help show this help message and exit --genmod generate the model --form FORM the orthography to be used (poj or tl). Default is poj. (not opened)

example1:

python3 ./pakkau.py --form tl "Lāu-su kóng: \"ta̍k-ke tsò-hué lâi\"" output:

老師講："逐家做伙來"

example2:

python3 ./pakkau.py --genmod generate models from the .csv parallel transliteration file in ./corpus files

unfinished

poj conversion the preciseness of the conversion