Abstract: In recent years, with the rapid development of deep learning technology, the Transformer model shows superior performance and is widely used in many fields such as natural language ...