[go: nahoru, domu]

Skip to content

Latest commit

 

History

History
 
 

decoding

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Aggressive Decoding

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding. https://arxiv.org/pdf/2205.10350.pdf

  • May 2022: preprint arXiv released; code updated and integrated

Introduction

Aggressive Decoding, a novel decoding paradigm for lossless speedup of seq2seq generation. Unlike the previous efforts (e.g., non-autoregressive decoding) speeding up seq2seq generation at the cost of quality loss, Aggressive Decoding aims to yield the identical (or better) generation compared with autoregressive decoding but in a significant speedup: For the seq2seq tasks characterized by highly similar inputs and outputs (e.g., Grammatical Error Correction and Text Simplification), the Input-guided Aggressive Decoding (IAD) can introduce a 7x-9x speedup for the popular 6-layer Transformer on GPU with the identical results as greedy decoding; For other general seq2seq tasks (e.g., Machine Translation and Abstractive Summarization), the Generalized Aggressive Decoding (GAD) can have a 3x-5x speedup with the identical or even better quality.

Please check out IAD and GAD in the sub-folders.

Acknowledgement

This repository is built using the Fairseq repository.

License

This project is licensed under the license found in the LICENSE file in the root directory of this source tree.

Microsoft Open Source Code of Conduct

Contact Information

For other communications related to Aggressive Decoding, please contact Tao Ge (tage@microsoft.com).