mirror of
https://github.com/SWivid/F5-TTS.git
synced 2025-12-12 15:50:07 -08:00
Update README
This commit is contained in:
11
README.md
11
README.md
@@ -1,16 +1,25 @@
|
||||
# F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
|
||||
|
||||
<div style="position: relative; width: 100%;">
|
||||
<div style="position: absolute; top: 0; right: 0;">
|
||||
<img src="https://avatars.githubusercontent.com/u/35554183?s=200&v=4" alt="Watermark" style="width: 140px; height: auto;">
|
||||
</div>
|
||||
</div>
|
||||
|
||||
[](https://github.com/SWivid/F5-TTS)
|
||||
[](https://arxiv.org/abs/2410.06885)
|
||||
[](https://swivid.github.io/F5-TTS/)
|
||||
[](https://huggingface.co/spaces/mrfakename/E2-F5-TTS)
|
||||
[](https://x-lance.sjtu.edu.cn/)
|
||||
|
||||
**F5-TTS**: Diffusion Transformer with ConvNeXt V2, faster trained and inference.
|
||||
|
||||
**E2 TTS**: Flat-UNet Transformer, closest reproduction.
|
||||
**E2 TTS**: Flat-UNet Transformer, closest reproduction from [paper](https://arxiv.org/abs/2406.18009).
|
||||
|
||||
**Sway Sampling**: Inference-time flow step sampling strategy, greatly improves performance
|
||||
|
||||
### Thanks to all the contributors !
|
||||
|
||||
## Installation
|
||||
|
||||
Clone the repository:
|
||||
|
||||
Reference in New Issue
Block a user