643 Commits

Author SHA1 Message Date
unknown
e636d98090 add transcribe function 2024-11-16 18:18:42 +02:00
unknown
96e24d3605 add 8bit and fix some value 2024-11-16 18:17:51 +02:00
unknown
1bbec4ac7e add 8bit and fix some value 2024-11-16 18:06:45 +02:00
Tyler Banks
829d265ee1 Removed redundant final chunk logic
Handle socket server edge case
2024-11-15 23:33:04 -05:00
SWivid
2a844ae064 minor update patch-1 2024-11-15 19:15:34 +08:00
SWivid
6f13ad4349 minor update 2024-11-15 18:50:35 +08:00
Yushen CHEN
ecbf6447dd Merge pull request #469 from JarodMica/main_repo_update
Allow for local path specification of HF models/repos
2024-11-15 18:22:06 +08:00
Jarod Mica
334367d070 Add JA model to SHARED.md 2024-11-15 02:10:33 -08:00
Jarod Mica
929b5ae313 run pre-commit 2024-11-15 01:50:20 -08:00
Jarod Mica
d1d8139bab Allow for local path specification of HF models/repos 2024-11-15 01:46:26 -08:00
Yushen CHEN
8f65f9f3e4 Update SHARED.md 2024-11-14 00:14:28 +08:00
SWivid
159797d7c4 formatting 2024-11-14 00:10:42 +08:00
Yushen CHEN
944d32862e Merge pull request #462 from RASPIAUDIO/patch-1
Shared Model Cards for french from @RASPIAUDIO
2024-11-13 23:26:05 +08:00
RASPIAUDIO
da00b90aae Update SHARED.md 2024-11-13 16:24:21 +01:00
RASPIAUDIO
e995c123e2 ressources added 2024-11-13 15:35:18 +01:00
RASPIAUDIO
4577644f5c added french 2024-11-13 15:30:22 +01:00
SWivid
be53fb179c fix. force reload from custom cache file to override infer_gradio init loaded value in case webpage refreshed 2024-11-11 14:46:50 +08:00
Yushen CHEN
da059e86b2 Update SHARED.md; typo 2024-11-11 11:20:11 +08:00
SWivid
a23ec25b39 v0.1.1 2024-11-11 11:19:04 +08:00
SWivid
59761ec9f8 Update. Cache last used custom model path #447 2024-11-11 11:13:11 +08:00
Yushen CHEN
e6e7bd01d3 Merge pull request #448 from peanutcocktail/main
Set PYTORCH_ENABLE_MPS_FALLBACK=1 for MPS by default
2024-11-11 07:11:19 +08:00
cocktailpeanut
5635eac12d PYTORCH_ENABLE_MPS_FALLBACK 2024-11-10 15:36:22 -05:00
SWivid
3fcdbc70b4 minor update 2024-11-09 04:54:06 +08:00
SWivid
dcd9a19889 v0.1.0. Add custom model support for local deploy; add share model cards, etc. 2024-11-09 04:35:44 +08:00
SWivid
23409af801 Update infer-gradio with ref_text auto-filling; minor fix 2024-11-09 01:14:54 +08:00
SWivid
c33a83c009 fix. better asr_transcript cache logic; global model select init. 2024-11-08 20:57:47 +08:00
Yushen CHEN
e78ae2ce92 Create config.yml 2024-11-07 14:23:52 +08:00
Yushen CHEN
da8853838f Update question.yml 2024-11-07 13:23:35 +08:00
Yushen CHEN
8a7e8495ff Merge pull request #401 from hcsolakoglu/fix-finetune-cli-bnb-optimizer
Add --bnb_optimizer argument to CLI and pass it to Trainer initialization
2024-11-05 20:20:31 +08:00
Yushen CHEN
dbe35da754 Update finetune_cli.py; formatting 2024-11-05 20:19:53 +08:00
hcsolakoglu
2033993f57 Add --bnb_optimizer argument to CLI and pass it to Trainer initialization
Add `--bnb_optimizer` argument to CLI and pass it to Trainer initialization.

* Add `--bnb_optimizer` argument to `parse_args()` function in `src/f5_tts/train/finetune_cli.py`.
* Pass `bnb_optimizer` argument to `Trainer` initialization in the `main()` function of `src/f5_tts/train/finetune_cli.py`.
2024-11-05 15:11:37 +03:00
SWivid
4a69e6bad2 formatting 2024-11-05 00:02:21 +08:00
Yushen CHEN
8718b1d821 Merge pull request #394 from lpscr/main
Fix the glitch effect at the beginning audio
2024-11-05 00:01:48 +08:00
Yushen CHEN
2f914140f4 Update utils_infer.py 2024-11-05 00:00:29 +08:00
unknown
e6f3e50eb1 Fix the glitch effect at the beginning audio 2024-11-04 17:05:16 +02:00
Yushen CHEN
b03e9b2952 Merge pull request #389 from kunci115/main
Bug fix
2024-11-04 17:12:23 +08:00
Yushen CHEN
c1c20ed009 Update socket_server.py, to pass format check 2024-11-04 17:11:29 +08:00
Rino
24cfa9ecb9 Update README.md 2024-11-04 15:50:15 +07:00
Rino
c129dd7ba4 Rename socket.py to socket_server.py
[bug fix] due to circular import, can't use socket as file name
2024-11-04 15:48:09 +07:00
Rino
a83e764110 Update socket.py
[edit] adjusting mel_spec_type on load_model use case
2024-11-04 15:46:00 +07:00
SWivid
ac77a76cd3 add issue templates 2024-11-04 02:17:52 +08:00
SWivid
61ff2a62d9 formatting #363, credit to @JarodMica, also dur_pred check fork repo 2024-11-03 16:37:47 +08:00
Yushen CHEN
1085b73f59 Merge pull request #354 from kunci115/main
[add] socket stream
2024-11-03 16:24:57 +08:00
Rino
6e24f1ea78 Merge branch 'SWivid:main' into main 2024-11-03 11:40:25 +07:00
SWivid
ea90244d62 fix. add dtype check for asr pipeline addressing #356 2024-11-02 13:48:37 +08:00
SWivid
f7e248e2ce formatting 2024-11-02 12:58:28 +08:00
Zhikang Niu
dc67a6819c Merge pull request #367 from justinjohn0306/main
Ensure tensors are moved to CPU before saving with torchaudio
2024-11-02 11:15:41 +08:00
Rino
0fe34a862c Merge branch 'SWivid:main' into main 2024-11-02 01:54:18 +07:00
Justin John
183ad09084 Ensure tensors are moved to CPU before saving with torchaudio 2024-11-01 23:47:00 +05:30
SWivid
b0f482421b fix-patch-2 for #361 2024-11-01 19:14:46 +08:00