OpenAI.fm：試用最新的 GPT-4o mini TTS 將文字轉換成自然流暢的語音

OpenAI.fm 是一個專為開發者設計的互動式展示平台，讓使用者可以體驗 OpenAI API 中最新的文字轉語音模型。這個平台提供了一個直觀的環境，讓開發者能夠輕鬆試用並探索其功能，進一步應用於自己的專案中。

接下來要介紹的是 GPT-4o mini TTS，這是一款基於 GPT-4o mini 打造的文字轉語音模型。GPT-4o mini 本身是一個快速且強大的語言模型，而 GPT-4o mini TTS 則利用其能力，將文字轉換成自然流暢的語音輸出。該模型支援最多 2000 個輸入 token，讓使用者能夠輕鬆將文字內容轉化為聽起來自然的人聲，適用於各種創意與實用場景。有了 OpenAI.fm，開發者可以親自試用這款先進的 GPT-4o mini TTS，感受其卓越的語音生成能力。

▲ 上方可以選擇人聲，有男生也有女生，左下則是語調及發音的微調，有提供多組範例，也可以透過提示詞自訂，右下是要講的文字稿。最後可以透過下方按鈕進行試聽及下載語音檔。

示範

語調及發音提示詞：

Voice:
Playful yet natural, lively-paced with a bouncy rhythm, featuring clear pauses and bright intonations characteristic of Taiwanese Mandarin speech patterns—reminiscent of a cheerful, friendly Taiwanese VTuber chatting with her audience.

Tone:
Cute, energetic, and sweetly persuasive, radiating friendly excitement that naturally builds curiosity and joy, creating a warm and inviting atmosphere that makes listeners feel like they’re part of a fun, shared moment.

Delivery:
Quick yet distinctly articulated, with expressive pitch variations—gentle rises and playful falls—adding charm and personality, maintaining engagement through a conversational, slightly teasing, and always cheerful style.

Pronunciation:
Crisp and clearly Taiwanese-accented Mandarin, with soft retroflex sounds (zh, ch, sh) and relaxed syllable transitions, emphasizing key action words with a bright, happy tone to encourage response and participation.

文字稿：

GPT-4o mini TTS 是一款建構在 GPT-4o mini 基礎上的文字轉語音模型，GPT-4o mini 本身是一個快速且強大的語言模型。你可以使用它將文字轉換成自然流暢的語音。輸入文字的最大字元上限為 2000 個 token。

贊助廣告 ‧ Sponsor advertisements

《上一篇》Google AI Studio：體驗 Gemini 2.0 Flash 超強的生圖與修圖能力！

《下一篇》MusicGPT：免費下載！用一句話創造你的專屬音樂

萌芽站長

您好，我是萌芽系列網站（Mnya Series Website）的站長&創始人，可以稱呼我「萌芽站長」。我的興趣與專長有登山、觀察地形、攝影、旅遊、網頁設計＆架設＆經營、動畫製作、圖片處理、資料彙整等。有任何問題或建議請至萌芽論壇發表。網站業務、商業合作的聯絡方式在「關於本站 → 團隊介紹 → 站長介紹」，很高興認識您！請多指教！

示範

留言區 / Comments

萌芽站長

熱門文章

近期文章

頁面

其他操作