Add encoding cache and lazy-train mechanism by chengfx · Pull Request #50 · microsoft/NeuronBlocks

chengfx · 2019-05-24T03:34:36Z

There are two major updates in this PR.

Encoding Cache
Now encoding cache mechanism is supported. It's a lazy-build progress. So we will build it in the first epoch and then reuse it in the rest epoch.
Lazy Train
We will load chunk_size cases to train in every part of an epoch avoiding out of memory in large training data

…lassifier

…ry Binary Classifier

chengfx · 2019-05-28T11:27:27Z

tasks                       GPU/CPU           old accuracy/AUC        accuracy/new AUC
english_text_matching           GPU             0.96655                 0.967
english_text_matching           CPU             0.96655                 0.967
chinese_text_matching           GPU             0.70001                 0.7
chinese_text_matching           CPU             0.70001                 0.7
quora_question_pairs            GPU             0.72596                 0.72086
quora_question_pairs            CPU             0.72596                 0.72086
knowledge_distillation          CPU             0.66329                 0.6628427083333334

… into dev/fecheng

…te tutorials

chengfx · 2019-08-01T09:27:32Z

New auto-test result @adolphk-yk

tasks			     GPU/CPU	old accuracy/AUC	new accuracy/AUC 
english_text_matching		GPU		0.96655			0.97075
english_text_matching		CPU		0.96655			0.97075
chinese_text_matching		GPU		0.70001			0.7
chinese_text_matching		CPU		0.70001			0.7
quora_question_pairs		GPU		0.72596			0.721861
quora_question_pairs		CPU		0.72596			0.721861
knowledge_distillation		CPU		0.66329			0.6580784722222223

Feixiang Cheng and others added 30 commits April 25, 2019 22:09

Add new config about knowledge distillation for query binary classifier

674526d

remove inferenced result in knowledge distillation for query binary c…

59d6318

…lassifier

Add AUC.py in tools folder

b4c110e

Add test_data_path into conf_kdqbc_bilstmattn_cnn.json

891f43a

Modify AUC.py

8b1d100

Rename AUC.py into calculate_AUC.py

333bd98

Merge branch 'master' into dev/fecheng

b6523a7

Modify test&calculate AUC commands for Knowledge Distillation for Que…

74976c2

…ry Binary Classifier

Merge branch 'master' into dev/fecheng

936d9fe

Add cpu_thread_num parameter in conf.training_params

8c6e61b

Rename cpu_thread_num into cpu_num_workers

69c0bca

update comments in ModelConf.py

fb11aba

Add cup_num_workers in model_zoo/advanced/conf.json

bbfcde2

Add the description of cpu_num_workers in Tutorial.md

153acd3

fix conflict

4c9380c

Merge branch 'master' into dev/fecheng

2ae9d4a

Update inference speed of compressed model

cff4cd3

Add ProcessorsScheduler Class

cf534ce

Merge branch 'master' into dev/fecheng

37d09d5

Add license in ProcessorScheduler.py

17b8447

use lazy loading instead of one-off loading

e087427

merge master

1fb0440

Remove Debug Info in problem.py

05ddcf8

use open instead of codecs.open

af6ea60

Merge branch 'master' into dev/fecheng

535649e

update the inference of build dictionary for classification

fb4e47b

add md5 function in common_utils.py

a3a0c25

add merge_encode_* function

889aa91

update typo

bab7f54

update typo

576b88d

chengfx added 4 commits May 23, 2019 21:42

add file_column_num in problem.py

25fa9f6

merge add_encoding_cache branch

d1321df

merge add_encoding_cache branch

ddb1f40

add SST-2 in .gitignore

026d8f0

chengfx requested review from adolphk-yk, ericwtlin, ljshou and woailaosang May 24, 2019 03:34

chengfx mentioned this pull request May 27, 2019

how to deal the large data when training model #53

Closed

chengfx added 4 commits May 27, 2019 16:53

merge master

27ff591

merge master

35693da

Merge branch 'master' into dev/fecheng

cd9ebdc

Merge branch 'master' into dev/fecheng

e2d295e

chengfx and others added 12 commits July 2, 2019 17:51

merge master

a876e90

use steps_per_validation instead of valid_times_per_epoch

4e068ca

Fix Learning Rate decay logic bug

751858f

add log of calculating md5 of training data

d0dd01a

fix multi-gpu char_emb OOM problem & add char leval fix_lengths

c172913

Merge branch 'dev/fecheng' of https://github.com/Microsoft/NeuronBlocks…

f2afd30

… into dev/fecheng

Modify batch_num_to_show_results in multi-gpu

dda750a

Merge branch 'dev/fecheng' of https://github.com/Microsoft/NeuronBlocks…

1357877

… into dev/fecheng

Modify batch_num_to_show_results

bc4c12b

delete deepcopy in get_batches

776c80b

Merge branch 'dev/fecheng' of https://github.com/Microsoft/NeuronBlocks…

2058521

… into dev/fecheng

add new parameters chunk_size and max_building_lines in conf and upda…

187a1aa

…te tutorials

adolphk-yk approved these changes Aug 2, 2019

View reviewed changes

ljshou merged commit 58ad563 into master Aug 2, 2019

ljshou deleted the dev/fecheng branch August 2, 2019 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add encoding cache and lazy-train mechanism#50

Add encoding cache and lazy-train mechanism#50
ljshou merged 69 commits intomasterfrom
dev/fecheng

chengfx commented May 24, 2019

Uh oh!

chengfx commented May 28, 2019

Uh oh!

chengfx commented Aug 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chengfx commented May 24, 2019

Uh oh!

chengfx commented May 28, 2019

Uh oh!

chengfx commented Aug 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants