Commit Graph

  • c0d2f624c0 Release 20250625 main Jong Wook Kim 2025-06-25 18:05:47 -07:00
  • db7fbc75fe Release 20250625 Jong Wook Kim 2025-06-25 18:02:39 -07:00
  • 31243bad24 Release 20250625 Jong Wook Kim 2025-06-25 18:00:48 -07:00
  • 1f8fc975d3 Fix: Update torch.load to use weights_only=True to prevent security w… (#2451) Dridi Yassin 2025-06-26 02:54:30 +02:00
  • 679ae1d141 Fix: Ensure DTW cost tensor is on the same device as input tensor (#2561) Nathan Harmon 2025-06-25 18:42:09 -06:00
  • f50c4f264e docs: updated README to specify translation model limitation (#2547) Nicholas Nadeau, Ph.D., P.Eng. 2025-06-25 20:03:47 -04:00
  • 86899243e9 Fixed triton kernel update to support latest triton versions (#2588) ExtReMLapin 2025-06-26 02:02:54 +02:00
  • 5dff4db81a Fix: GitHub display errors for Jupyter notebooks (#2589) Learpcs 2025-06-26 03:55:15 +04:00
  • dd985ac4b9 Bump the github-actions group with 3 updates (#2592) dependabot[bot] 2025-05-13 11:22:31 -07:00
  • e1e6aa60ff Keep GitHub Actions up to date with GitHub's Dependabot (#2486) Christian Clauss 2025-05-13 20:10:43 +02:00
  • e6a5fc0ff0 pre-commit: Upgrade black v25.1.0 and isort v6.0.0 (#2514) Christian Clauss 2025-05-13 18:43:34 +02:00
  • 13907bed90 GitHub Actions: Add Python 3.13 to the testing (#2487) Christian Clauss 2025-05-13 06:10:40 +02:00
  • 517a43ecd1 Update python-publish.yml Jong Wook Kim 2025-01-04 12:56:16 -08:00
  • dd4d010d2c PEP 621: Migrate from setup.py to pyproject.toml (#2435) Christian Clauss 2025-01-04 10:38:35 +01:00
  • 26a7cacc83 pre-commit autoupdate && pre-commit run --all-files (#2484) Christian Clauss 2025-01-04 10:02:18 +01:00
  • 6c1d8f1ea1 Upgrade GitHub Actions (#2430) Christian Clauss 2025-01-04 09:47:12 +01:00
  • 90db0de189 Bugfix: Illogical "Avoid computing higher temperatures on no_speech" (#1903) Purfview 2024-12-01 05:47:01 +00:00
  • fc5ded7d90 Updating README and doc strings to reflect that n_mels can now be 128 (#2049) Lowell Vaughn 2024-11-26 09:37:01 -08:00
  • 173ff7dd1d fix typo data/README.md (#2433) f1sh 2024-11-13 08:35:54 +08:00
  • 271445b2f2 Update README.md (#2379) BotMaster3000 2024-11-04 08:00:30 +01:00
  • 5979f03701 Add option to carry initial_prompt with the sliding window (#2343) kittsil 2024-10-26 09:17:31 -05:00
  • cdb8147962 more pytorch versions in tests (#2408) Jong Wook Kim 2024-10-25 17:30:02 -07:00
  • 25639fc17d Release 20240930 Jong Wook Kim 2024-09-30 11:20:53 -07:00
  • 260bbcfcb3 allowing numpy 2 in tests (#2362) Jong Wook Kim 2024-09-30 11:18:17 -07:00
  • 25e5c364e0 large-v3-turbo model (#2361) Jong Wook Kim 2024-09-30 10:59:51 -07:00
  • b66b46f32d test on python/pytorch versions up to 3.12 and 2.4.1 (#2360) Jong Wook Kim 2024-09-30 10:33:56 -07:00
  • 27f971320a using sdpa if available (#2359) Jong Wook Kim 2024-09-30 10:27:14 -07:00
  • 423492dda7 Release 20240927 Jong Wook Kim 2024-09-27 16:43:58 -07:00
  • 279133e310 pinning numpy<2 in tests (#2332) Jong Wook Kim 2024-09-10 10:43:21 -07:00
  • 32d55d5d76 Relax triton requirements for compatibility with pytorch 2.4 and newer (#2307) Jianan Xing 2024-09-10 09:53:08 -07:00
  • ba3f3cd54b Skip silence around hallucinations (#1838) ryanheise 2023-12-19 07:11:16 +11:00
  • 8bc8860694 Fix triton env marker (#1887) Bob Lin 2023-12-11 23:39:08 +08:00
  • e58f288045 Release 20231117 Jong Wook Kim 2023-11-17 11:59:28 -08:00
  • 1cea435768 Relax triton requirements for compatibility with pytorch 2.1 and newer (#1802) Eugene Indenbom 2023-11-13 19:43:42 +02:00
  • fcfeaf1b61 Release 20231106 Jong Wook Kim 2023-11-06 10:14:04 -08:00
  • c5d4256076 large-v3 (#1761) Jong Wook Kim 2023-11-06 10:10:30 -08:00
  • f6f01c561c Release 20231105 Jong Wook Kim 2023-11-06 03:08:56 -08:00
  • 746aaaeafa remove tiktoken pin (#1759) Jong Wook Kim 2023-11-06 03:05:21 -08:00
  • b9f17e1f2d docs: Disambiguation of the term "relative speed" in the README (#1751) Philippe Hebert 2023-11-06 05:43:07 -05:00
  • 7dfcd56304 allow_pickle=False while loading of mel matrix IN audio.py (#1511) Mohamad Zamini 2023-11-06 03:28:51 -07:00
  • b7d277acd5 handling transcribe exceptions. (#1682) Marco Zucconelli 2023-11-06 11:06:19 +01:00
  • 6ed314fe41 Add new option to generate subtitles by a specific number of words (#1729) amosal 2023-11-06 10:49:33 +01:00
  • b38a1f20f4 Fix exception when an audio file with no speech is provided (#1396) Jordi Mas 2023-10-10 19:01:01 +02:00
  • 0a60fcaa9b Release 20230918 Jong Wook Kim 2023-09-18 17:13:19 -07:00
  • 5f957da5ca Update test.yml Jong Wook Kim 2023-09-18 16:38:17 -07:00
  • 8b330df096 Add .pre-commit-config.yaml (#1528) Arthur Kim 2023-09-19 08:15:33 +09:00
  • 21010ef454 fix doc of TextDecoder (#1526) sqhao 2023-09-19 07:09:59 +08:00
  • 29b7df6231 Update model-card.md (#1643) Nino Risteski 2023-09-19 00:59:49 +02:00
  • e8622f9afc word timing tweaks (#1559) taylorchu 2023-08-07 14:48:56 -07:00
  • b91c907694 Avoid rearranging all caches (#1483) WangChou Lu 2023-07-07 03:48:08 +08:00
  • f572f2161b Improve timestamp heuristics. (#1461) ryanheise 2023-06-30 09:51:24 +10:00
  • 248b6cb124 fix condition_on_previous_text (#1224) Valentin Berkes 2023-05-05 09:31:35 +02:00
  • 7ca9fbea86 Fix numba depreceation notice (#1233) Paul Willot 2023-05-05 08:48:06 +02:00
  • b1c0815c79 Updated README.md to provide more insight on BLEU and specific appendices (#1236) Brett Balquist 2023-05-05 01:47:45 -05:00
  • e334ff141d Avoid computing higher temperatures on no_speech segments (#1279) Théo BOYER 2023-05-05 02:02:36 +02:00
  • 5523722842 Dropped unused execute bit from mel_filters.npz. (#1254) petterreinholdtsen 2023-05-04 19:58:56 +02:00
  • 8035e9ef48 Drop ffmpeg-python dependency and call ffmpeg directly. (#1242) petterreinholdtsen 2023-05-04 19:53:59 +02:00
  • e69930cb9c Python 3.11 (#1171) Johnny 2023-05-04 19:42:09 +02:00
  • c09a7ae299 Update decoding.py (#1219) Jong Wook Kim 2023-04-11 18:13:13 -04:00
  • b0022b3283 Update decoding.py (#1155) Fernando O. Gallego 2023-04-12 00:06:03 +02:00
  • 76c901ab8d Update README.md to reference tiktoken (#1105) Arseniy Bushyn 2023-04-11 03:39:17 +03:00
  • 43940fc978 Implement max line width and max line count, and make word highlighting optional (#1184) ryanheise 2023-04-11 10:28:35 +10:00
  • 255887f219 Squash long words at window and sentence boundaries. (#1114) ryanheise 2023-04-11 10:23:53 +10:00
  • a151816b6b python-publish.yml: bump actions version to fix node warning (#1211) K.B.Dharun Krishna 2023-04-11 02:24:09 +05:30
  • b5851c6c40 Update tokenizer.py (#1163) Jong Wook Kim 2023-03-29 16:12:36 -04:00
  • 6dea21fd7f Release 20230314 Jong Wook Kim 2023-03-15 00:39:05 -07:00
  • 79c43e4859 abort find_alignment on empty input (#1090) Jong Wook Kim 2023-03-14 15:47:58 -04:00
  • 5f9ac653b7 Fix truncated words list when the replacement character is decoded (#1089) Guillaume Klein 2023-03-14 17:32:41 +01:00
  • ba88b8e1b3 fix github language stats getting dominated by jupyter notebook (#1076) Akash Mahajan 2023-03-14 00:07:09 -07:00
  • 671ac5a4ce Fix alignment between the segments and the list of words (#1087) Guillaume Klein 2023-03-14 00:34:09 +01:00
  • 839639a223 Use tiktoken (#1044) Jong Wook Kim 2023-03-13 05:34:16 -04:00
  • ad3250a846 Release 20230308 Jong Wook Kim 2023-03-08 15:48:57 -08:00
  • c4b50c0824 kwargs in decode() for convenience (#1061) Jong Wook Kim 2023-03-08 18:46:38 -05:00
  • 38f2f4d99d fix all_tokens handling that caused more repetitions and discrepancy in JSON (#1060) Jong Wook Kim 2023-03-08 18:34:07 -05:00
  • aac47c9834 fix typo Jong Wook Kim 2023-03-07 20:43:49 -08:00
  • 26807ec6d3 Release 20230307 Jong Wook Kim 2023-03-07 20:36:29 -08:00
  • 919a713499 attempt to fix the repetition/hallucination issue identified in #1046 (#1052) Jong Wook Kim 2023-03-07 23:08:45 -05:00
  • 38e990d853 Use triton==2.0.0 (#1053) Jong Wook Kim 2023-03-07 19:56:31 -05:00
  • 924e1f8e06 Try installing triton only if linux & x86_64 (#1051) Jong Wook Kim 2023-03-07 14:31:40 -05:00
  • 4b0d5e58d0 Update setup.py Jong Wook Kim 2023-03-07 04:47:46 -08:00
  • 8180fde939 Release 20230306 Jong Wook Kim 2023-03-06 18:50:41 -08:00
  • c6e4e5efb3 remove auxiliary audio extension (#1021) Local State 2023-03-06 20:48:14 -05:00
  • b80bcf610d apply formatting with black (#1038) Jong Wook Kim 2023-03-06 18:50:37 -05:00
  • 500d0fe966 word-level timestamps in transcribe() (#869) Jong Wook Kim 2023-03-06 17:00:49 -05:00
  • eab8d920ed Decoding improvements (#1033) Jong Wook Kim 2023-03-06 14:32:32 -05:00
  • 3e1780fd37 Update README.md (#894) Roman Vasilenko 2023-03-03 19:41:59 -05:00
  • 7858aa9c08 Fix infinite loop caused by incorrect timestamp tokens prediction (#914) Andrey Chernykh 2023-02-02 06:46:51 +07:00
  • 5c1a8c10e7 clarify that 3.11 is not supported Jong Wook Kim 2023-01-27 00:01:49 -08:00
  • 4e635c6644 Update README.md about Python 3.8+ requirement Jong Wook Kim 2023-01-24 14:45:56 -08:00
  • a6b36ede1f drop python 3.7 support (#889) Jong Wook Kim 2023-01-24 14:05:57 -08:00
  • 55f690af79 Release 20230124 Jong Wook Kim 2023-01-24 11:11:08 -08:00
  • 7f1ef223ab handle printing even if sys.stdout.buffer is not available (#887) Jong Wook Kim 2023-01-24 10:12:04 -08:00
  • f5bfe004ec Add TSV formatted output in transcript, using integer start/end times in milliseconds. (#228) Niels Mayer 2023-01-22 00:27:17 -08:00
  • da600abd2b Added --output_format option (#333) Aaryan YVS 2023-01-22 13:28:38 +05:30
  • 9f7aba6099 Handle XDG_CACHE_HOME properly for download_root (#864) zer0-x 2023-01-21 12:09:39 +03:00
  • 12e1089462 use stdout for printing transcription progress (#867) Jong Wook Kim 2023-01-20 00:54:05 -08:00
  • ea1c266709 Fix bug where mm is mistakenly replaced with hmm in e.g. 20mm (#659) Markus Hennerbichler 2023-01-18 18:41:11 +00:00
  • 8135a7c31c verbose outputs from pytest Jong Wook Kim 2023-01-18 10:30:18 -08:00
  • 9d646db9d8 print '?' if a letter can't be encoded using the system default encoding (#859) Jong Wook Kim 2023-01-17 23:28:36 -08:00
  • 37a4f1be6d Release 20230117 Jong Wook Kim 2023-01-17 16:08:28 -08:00