Files
supertonic-3-mlx/bench_results.csv
transcrilive 12dbf4a821 v0.1.0 — initial release
MLX-native port of Supertone's Supertonic 3 multilingual TTS. Runs the
full flow-matching + classifier-free-guidance pipeline at ~x100 realtime
on Apple Silicon, with audio cosine 1.0 vs the cached MLX path and
cosine 0.98 vs the upstream ONNX Runtime reference.

Weights are hosted at https://huggingface.co/ambassadia/supertonic-3-mlx
and auto-downloaded on first use; this repository ships the port code,
the model card, audio samples, and a zero-config setup_and_test.sh.

Install:
    pip install git+https://gitea.tavportal.com/olivier/supertonic-3-mlx.git

Quick test:
    git clone https://gitea.tavportal.com/olivier/supertonic-3-mlx.git
    cd supertonic-3-mlx && ./setup_and_test.sh

Licenses (dual): model weights = BigScience Open RAIL-M (Section 4
propagation), port code = Apache-2.0. See LICENSE, LICENSE-CODE, NOTICE.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-20 09:17:05 +02:00

1.1 KiB

1filenamelanguagevoicetextduration_smlx_ms_medianrtf_mlxonnx_ms_medianrtf_onnxspeedup_mlx_over_onnx
2samples/en_F1_short.wavenF1Hello world from Apple Silicon. Supertonic 3 runs at one hundred times real time.2.78636.676.21004.72.827.5
3samples/en_M1_long.wavenM1A gentle breeze moved through the open window while the children, still half-asleep, listened to the distant sound of the harbour bells.3.90138.4101.71356.02.935.3
4samples/fr_F2.wavfrF2Bonjour, ceci est un test de synthèse vocale en français. Le modèle gère trente-et-une langues sur une puce M4.3.41337.990.11195.62.931.6
5samples/de_M2.wavdeM2Guten Morgen. Dieses Modell läuft komplett auf Apple Silicon, ohne ONNX und ohne CoreML, in reinem MLX.3.69238.196.91313.92.834.5
6samples/ja_F3.wavjaF3こんにちは。これはアップルシリコン上でMLXを使ったテストです。1.46332.145.6848.41.726.4
7samples/es_M3.wavesM3Hola, esto es una prueba de síntesis de voz en español ejecutada en tiempo real sobre Apple Silicon.2.85637.077.21002.12.927.1