AST with Transfer Learning (playground)

Attention-based sequence to sequence AST System. The model uses ASR and MT Systems for initialization (Transfer Learning).

The project is based on the code of Alexandre Bérard https://github.com/eske/seq2seq

## Getting Started

### Prerequisites and Installing

1. Follow the instructions https://github.com/eske/seq2seq to install the dependencies and the seq2seq system.
2. Download the folder cluster/home/proj/speechrecognition/TRANSFER_LEARNING_AST 

### Input Data

* Used Input Data (given):

cluster/home/proj/speech_data_models/data


* Used pre-trained ASR and MT Models (given)

ASR ("best-276000" was used for initialisation):

cluster/home/proj/speechrecognition/TRANSFER_LEARNING_AST/models/ASR

MT ("best-98000" was used for initialisation):

cluster/home/proj/speechrecognition/TRANSFER_LEARNING_AST/models/MT


## Learn the system
## READY-TO-USE MODEL:

 **Our ready-to-use best model:**

/home/proj/speechrecognition/TRANSFER_LEARNING_AST/models2/best.index

* Other checkpoints and log files: 

--- First 220000 steps (srarting from 98000 step):

cluster/home/proj/speechrecognition/TRANSFER_LEARNING_AST/models

--- Next steps:

/home/proj/speechrecognition/TRANSFER_LEARNING_AST/models2/

* Configuration files:

--- First 220000 steps (starting from step 98000):

cluster/home/proj/speechrecognition/TRANSFER_LEARNING_AST/transfer_1.yaml

--- Next steps:

cluster/home/proj/speechrecognition/TRANSFER_LEARNING_AST/transfer_2.yaml

## TRAIN YOUR OWN MODEL:

- Create new output folder, change directions/hyperparameters/checkpoints in transfer_1.yaml configuration file.

- Change to your seq2seq folder, run the following script:

./seq2seq.sh CONFIG --train -v

...where CONFIG is the direction to the transfer_1.yaml file on your computer.


## Other possobilities:

* Input Data produced by our group:
tar audio files:

cluster/home/proj/speechrecognition/archiv

npz, vocab und spain textfiles:

cluster/home/students/chernenko/speech_recognition/features

* To generate your own input Data with scripts of our group:
cluster/home/proj/speechrecognition/seq2seq

1_dataset_devide.py 2_change_format.sh 3_split_files.py 4_extract.py (original script - seq2seq/scripts/speech/extract.py)


* Use other versions of pre-trained ASR and MT Models (new ones):

cluster/home/proj/speech_data_models/models/


## Built With

* [Alexandre Bérard] https://github.com/eske/seq2seq) 

## Authors

* **Chernenko Tetyana (Tatjana)** 
* **Liang Siting**
{chernenko/liang}@cl.uni-heidelberg.de

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
1_dataset_devide.py		1_dataset_devide.py
2_change_format.sh		2_change_format.sh
3_split_files.py		3_split_files.py
4_extract_mfcc_yaafe.py		4_extract_mfcc_yaafe.py
AST_Transfer_Learning_report.pdf		AST_Transfer_Learning_report.pdf
README.md		README.md
config.json		config.json
config.py		config.py
requerements.txt		requerements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AST with Transfer Learning (playground)

About

Releases

Packages

Languages

TatjanaChernenko/automatic_speech_translation_transfer_learning

Folders and files

Latest commit

History

Repository files navigation

AST with Transfer Learning (playground)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages