Synthalingua Version 1.2.5 Change Log #202
cyberofficial
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Synthalingua Version 1.2.5 Change Log
Description of Changes
This update introduces a major new feature: an intelligent adaptive batch processing system that dynamically allocates transcription jobs between the GPU and CPU for maximum performance. A comprehensive bug report generator and an SRT subtitle fixer have also been added to improve troubleshooting and usability. Additionally, the setup process for vocal isolation has been significantly simplified by replacing Miniconda with a more lightweight, portable Python environment, and the packaged CUDA version has been updated.
Noticeable Changes
--adaptive_batchmode for caption generation intelligently distributes work between your GPU and CPU. It learns your system's performance and allocates longer tasks to the GPU and shorter ones to the CPU, significantly speeding up processing for large files.--bugreportcommand has been added. It generates a detailedbugreportinfo.txtfile with system, Python, and hardware information to make troubleshooting and reporting issues much easier.--fixsrtutility has been added to repair SRT files with out-of-order timestamps, ensuring subtitles are always displayed chronologically.--using_vocal_isolationfeature no longer uses Miniconda. It now sets up a portable, embedded Python environment, reducing the installation footprint, complexity, and potential for environment conflicts.set_up_env.pyscript is now fully cross-platform, supporting Windows, Linux, and macOS. Linux users are given a choice between using their system Python or the new embedded Python for vocal isolation.--aboutcommand now displays a modern, animated interface with project details, features, and contributor acknowledgements.--httpscommand-line argument allows running a secure web server alongside the existing HTTP one.player.htmlinterface has been completely overhauled with a modern design, a two-column layout for settings, and the ability to change the video source directly from the UI. If no video source is provided in the URL, the page now interactively prompts the user for one.--list_microphonescommand now provides a much more detailed and user-friendly table with device ID, channels, and sample rate, along with clear instructions to use the ID for selection.remote_microphone.pyscript now supports binding the server to a specific IP address using the--serveripargument, allowing for network access beyond localhost.Hidden Changes
modules/adaptive_batch.pymodule has been added to handle the logic for GPU/CPU detection, performance tracking, and dynamic job scheduling.demucs_path_helper.pyscript has been rewritten to be more intelligent, actively searching multiple common locations (embedded Python, virtual environments, system paths) to find a validdemucsinstallation.--batchmode, if a transcription job times out, the system now interactively prompts the user to either retry with an increased timeout or skip the problematic segment.build.batscript has been simplified to directly usepyinstallerwith the corresponding.specfiles, streamlining the build process.[ORIGINAL],[TRANSLATION]) for better clarity.Technical Changes
modules/version_checker.pyhas been updated to 1.2.5.modules/parser_args.pywas updated to include the new--bugreport,--fixsrt,--https,--adaptive_batch,--batchjobsize,--cpu_batches,--max_cpu_time, and--stop_cpu_atarguments.modules/adaptive_batch.pyfor the new processing scheduler,modules/bug_report.pyfor generating system reports, andmodules/srt_fix.pyfor the subtitle repair utility.set_up_env.pyOverhaul: The script was rewritten to manage a portable, embedded Python installation instead of Miniconda, handle platform-specific downloads, and generate abugreportinfo.txtfile upon completion.set_up_env.pyscript now sets environment variables (TORIO_USE_FFMPEG=0) to forcetorchaudioto use thesoundfilebackend, preventing conflicts with FFmpeg/torchcodec when runningdemucs.setup.bat,setup.sh) have been updated to use the PyTorch index URL for CUDA 12.9 (cu129) instead of 12.8.transcribe_worker.pyscript now accepts a--debugflag for more verbose logging and has improved UTF-8 handling for stability.synthalingua.specfile now includestorchcodecin its data collection and changes the final output directory name torelease.stop_queue_processingmethod inmodules/transcription_core.pywas improved to be more robust and prevent race conditions on shutdown.misc/directory, and a newplan.mdfile has been added, outlining the design of the adaptive batching system.Beta Was this translation helpful? Give feedback.
All reactions