Fhrozen commited on
Commit
1025952
1 Parent(s): c451c1f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -3
README.md CHANGED
@@ -12,8 +12,10 @@ pinned: false
12
  ESPnet is an end-to-end speech processing toolkit covering end-to-end speech recognition, text-to-speech, speech translation, speech enhancement, speaker diarization, spoken language understanding, and so on.
13
  ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also follows [Kaldi](http://kaldi-asr.org/) style data processing, feature extraction/format, and recipes to provide a complete setup for various speech processing experiments.
14
 
15
- <details><summary>Citations</summary>
16
 
 
 
 
17
  @inproceedings{watanabe2018espnet,
18
  author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson {Enrique Yalta Soplin} and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
19
  title={{ESPnet}: End-to-End Speech Processing Toolkit},
@@ -23,6 +25,7 @@ ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also fo
23
  doi={10.21437/Interspeech.2018-1456},
24
  url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
25
  }
 
26
  @inproceedings{hayashi2020espnet,
27
  title={{Espnet-TTS}: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit},
28
  author={Hayashi, Tomoki and Yamamoto, Ryuichi and Inoue, Katsuki and Yoshimura, Takenori and Watanabe, Shinji and Toda, Tomoki and Takeda, Kazuya and Zhang, Yu and Tan, Xu},
@@ -31,6 +34,7 @@ ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also fo
31
  year={2020},
32
  organization={IEEE}
33
  }
 
34
  @inproceedings{inaguma-etal-2020-espnet,
35
  title = "{ESP}net-{ST}: All-in-One Speech Translation Toolkit",
36
  author = "Inaguma, Hirofumi and
@@ -48,6 +52,7 @@ ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also fo
48
  url = "https://www.aclweb.org/anthology/2020.acl-demos.34",
49
  pages = "302--311",
50
  }
 
51
  @inproceedings{li2020espnet,
52
  title={{ESPnet-SE}: End-to-End Speech Enhancement and Separation Toolkit Designed for {ASR} Integration},
53
  author={Chenda Li and Jing Shi and Wangyou Zhang and Aswin Shanmugam Subramanian and Xuankai Chang and Naoyuki Kamo and Moto Hira and Tomoki Hayashi and Christoph Boeddeker and Zhuo Chen and Shinji Watanabe},
@@ -56,11 +61,11 @@ ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also fo
56
  year={2021},
57
  organization={IEEE},
58
  }
 
59
  @article{arora2021espnet,
60
  title={ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet},
61
  author={Arora, Siddhant and Dalmia, Siddharth and Denisov, Pavel and Chang, Xuankai and Ueda, Yushi and Peng, Yifan and Zhang, Yuekai and Kumar, Sujay and Ganesan, Karthik and Yan, Brian and others},
62
  journal={arXiv preprint arXiv:2111.14706},
63
  year={2021}
64
  }
65
-
66
- </details>
 
12
  ESPnet is an end-to-end speech processing toolkit covering end-to-end speech recognition, text-to-speech, speech translation, speech enhancement, speaker diarization, spoken language understanding, and so on.
13
  ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also follows [Kaldi](http://kaldi-asr.org/) style data processing, feature extraction/format, and recipes to provide a complete setup for various speech processing experiments.
14
 
 
15
 
16
+ ## Citations
17
+
18
+ ```BibTex
19
  @inproceedings{watanabe2018espnet,
20
  author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson {Enrique Yalta Soplin} and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
21
  title={{ESPnet}: End-to-End Speech Processing Toolkit},
 
25
  doi={10.21437/Interspeech.2018-1456},
26
  url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
27
  }
28
+
29
  @inproceedings{hayashi2020espnet,
30
  title={{Espnet-TTS}: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit},
31
  author={Hayashi, Tomoki and Yamamoto, Ryuichi and Inoue, Katsuki and Yoshimura, Takenori and Watanabe, Shinji and Toda, Tomoki and Takeda, Kazuya and Zhang, Yu and Tan, Xu},
 
34
  year={2020},
35
  organization={IEEE}
36
  }
37
+
38
  @inproceedings{inaguma-etal-2020-espnet,
39
  title = "{ESP}net-{ST}: All-in-One Speech Translation Toolkit",
40
  author = "Inaguma, Hirofumi and
 
52
  url = "https://www.aclweb.org/anthology/2020.acl-demos.34",
53
  pages = "302--311",
54
  }
55
+
56
  @inproceedings{li2020espnet,
57
  title={{ESPnet-SE}: End-to-End Speech Enhancement and Separation Toolkit Designed for {ASR} Integration},
58
  author={Chenda Li and Jing Shi and Wangyou Zhang and Aswin Shanmugam Subramanian and Xuankai Chang and Naoyuki Kamo and Moto Hira and Tomoki Hayashi and Christoph Boeddeker and Zhuo Chen and Shinji Watanabe},
 
61
  year={2021},
62
  organization={IEEE},
63
  }
64
+
65
  @article{arora2021espnet,
66
  title={ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet},
67
  author={Arora, Siddhant and Dalmia, Siddharth and Denisov, Pavel and Chang, Xuankai and Ueda, Yushi and Peng, Yifan and Zhang, Yuekai and Kumar, Sujay and Ganesan, Karthik and Yan, Brian and others},
68
  journal={arXiv preprint arXiv:2111.14706},
69
  year={2021}
70
  }
71
+ ```