GitHub - ilovin/lstm_ctc_ocr at c33361029ca4d2bc5908dc108bec417f7b3212e6

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
fonts		fonts
lib		lib
lstm		lstm
.gitignore		.gitignore
README.md		README.md
test.sh		test.sh
train.sh		train.sh

Repository files navigation

master:
harder to converge compare to the beta version
both standard ctc and warpCTC
read data at once
dev:
the pipline version of lstm_ctc_ocr, resize to same size
beta:
generate data on the fly
deal with multi-width image, padding to same width

How to use

run python genImg.py to generate the train images in train/, validation set in test/and the file name shall has the format of 00000001_name.png, the number of process is set to 16.
cd standard or cd warpCTC
run python lstm_ocr.py to training

Notice that,

standard : use tf.nn.ctc_loss to calculate the ctc loss
warpCTC : please install the warpCTC tensorflow_binding first

Dependency

python 3
tensorflow 1.0.1
captcha
warpCTC tensorflow_binding

Some details

The training data:

Notice that, parameters can be found in ./lstm.yml(higher priority) and lib/lstm/utils/config.y
some parameters need to be fined tune:

learning rate
decay step & decay rate
image_height
optimizer?

in ./lib/lstm/utils/gen.py, the height of the images are the same, and I pad the width to the same for each batch, so if you want to use your own data, the height of the image shall be the same.

Result

The accurary can be more that 95%

Read this blog for more details and this blog for how to use tf.nn.ctc_loss or warpCTC

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to use

Dependency

Some details

Result

About

Releases

Packages

Contributors 2

Languages

ilovin/lstm_ctc_ocr

Folders and files

Latest commit

History

Repository files navigation

How to use

Dependency

Some details

Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages