forked from data-science-on-aws/data-science-on-aws
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
12 changed files
with
621 additions
and
605 deletions.
There are no files selected for viewing
576 changes: 232 additions & 344 deletions
576
06_train/04_Train_Reviews_BERT_TensorFlow2_ScriptMode.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,11 @@ | ||
# Add python dependencies here... | ||
scikit-learn==0.20.3 | ||
nltk==3.4.5 | ||
# SageMaker bug that requires us to do this in the code directly. | ||
#scikit-learn==0.20.3 | ||
#tensorflow-hub==0.7.0 | ||
#bert-tensorflow==1.0.2 | ||
|
||
tensorflow==2.2.0-rc1 | ||
grpcio | ||
tqdm | ||
bert-for-tf2 | ||
sentencepiece |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,4 @@ | ||
python copy_data_locally.py | ||
|
||
SM_CHANNEL_TRAIN=feature-store/amazon-reviews/csv/balanced-tfidf-without-header/train SM_CHANNEL_VALIDATION=feature-store/amazon-reviews/csv/balanced-tfidf-without-header/validation SM_MODEL_DIR=. python xgboost_reviews.py | ||
# --num-rounds=10 | ||
rm -rf model/ | ||
rm -rf output/ | ||
#SM_INPUT_DATA_CONFIG={\"train\":{\"TrainingInputMode\":\"Pipe\"}} | ||
SM_CURRENT_HOST=blah SM_NUM_GPUS=0 SM_HOSTS={\"hosts\":\"blah\"} SM_MODEL_DIR=model/ SM_OUTPUT_DATA_DIR=output/ SM_CHANNEL_TRAIN=data/train SM_CHANNEL_VALIDATION=data/validation SM_CHANNEL_TEST=data/test python tf_bert_reviews.py --use-xla=False --use-amp=False --train-batch-size=8 --validation-batch-size=8 --test-batch-size=8 --epochs=2 --train-steps-per-epoch=10 --validation-steps=10 --test-steps=10 --max-seq-length=128 --freeze-bert-layer=False --enable-sagemaker-debugger=True |
Large diffs are not rendered by default.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Add python dependencies here... | ||
scikit-learn==0.20.3 | ||
nltk==3.4.5 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
python copy_data_locally.py | ||
|
||
SM_CHANNEL_TRAIN=feature-store/amazon-reviews/csv/balanced-tfidf-without-header/train SM_CHANNEL_VALIDATION=feature-store/amazon-reviews/csv/balanced-tfidf-without-header/validation SM_MODEL_DIR=. python xgboost_reviews.py | ||
# --num-rounds=10 |
File renamed without changes.