-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some problems when I train the model #9
Comments
Hi, looks like problem with data feeding.
there is piece of bad code in data_gen.py:
|
Hi,@MichalBusta @ustczhouyu I meet the same issues as you've asked. And I solve the problem by commenting the following lines associated with dg_ocr:
I think the main reason is that two threads 'dg_ocr' and 'data_generator' conflicts with each other in each training epoch. @MichalBusta do you have any other approach to solve this problem? |
Hi, nice to know that you have synthesized a multilingual data set Synthetic Multi-Language in Natural Scene Dataset, I don't know how to download it, can you send it to me? Thank you very much.
At 2018-11-28 21:35:55, "Michal Busta" <notifications@github.com> wrote:
Hi, looks like problem with data feeding.
you can try: use -debug=1 flag to see the training data
there is piece of bad code in data_gen.py:
if not os.path.exists(im_name): continue im = cv2.imread(im_name) if im is None: continue
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
https://github.com/MichalBusta/E2E-MLT -section Data |
@ustczhouyu @MiZhangWhuer @MichalBusta Hi, I meet the same question, and I changed according to the above.But the error still occur,hope you give me some solution.Look forward to your reply.Thank you. |
@ycjcy @MichalBusta If you are running the sample data that is provided in the repository try making batchsize=2 as I noticed it was an issue with batchsize=8 it would never hit the terminating case. |
@MichalBusta @MiZhangWhuer @ycjcy @LittlePinkRobin @ustczhouyu hello everyone! I want to know the function of "-ocr_feed_list" in the train.py? And where can I get the cropped image? Thanks |
When I use ICDAR2015 to train the model,
Inside the file sample_train_data/MLT/trainMLT.txt are icdar2015 localization training images such as icdar-2015-Ch4/Train/img_1.jpg and inside sample_train_data/MLT_CROPS/gt.txt are icdar2015 recognition training images such as word_1.png, "Genaxis Theatre".
I have not changed other paths. When I train the model by:
python3 train.py -train_list=sample_train_data/MLT/trainMLT.txt -batch_size=8 -num_readers=5 -debug=0 -input_size=512 -ocr_batch_size=256 -ocr_feed_list=sample_train_data/MLT_CROPS/gt.txt
the output are:
root@10ca3ad2a7d1:/home/zy/jupyter/recognition/spotter/E2E-MLT-master# python3 train.py -train_list=sample_train_data/MLT/trainMLT.txt -batch_size=8 -num_readers=5 -debug=0 -input_size=512 -ocr_batch_size=256 -ocr_feed_list=sample_train_data/MLT_CROPS/gt.txt
Using E2E-MLT
loading model from e2e-mlt.h5
e2e-mlt.h5
1000 training images in sample_train_data/MLT/trainMLT.txt
1000 training images in sample_train_data/MLT/trainMLT.txt
1000 training images in sample_train_data/MLT/trainMLT.txt
1000 training images in sample_train_data/MLT/trainMLT.txt
1000 training images in sample_train_data/MLT/trainMLT.txt
4468 training images in sample_train_data/MLT_CROPS/gt.txt
4468 training images in sample_train_data/MLT_CROPS/gt.txt
I waited for half an hour, but no more output. can you help me? thank you.
The text was updated successfully, but these errors were encountered: