Categories
Bisnis Industri

How to use iPhone’s awesome new text-to-speech feature

[ad_1]

Text-to-Speech From Your Phone
Your iPhone has a text-to-speech feature built-in. You don’t need to download an app.
Image: D. Griffin Jones/Cult of Mac

In iOS 17, the iPhone got a built-in text-to-speech feature called Live Speech. You can even use Live Speech with a digital version of your own voice called Personal Voice.

Apple devised Personal Voice for users “at risk of losing their ability to speak — such as those with a recent diagnosis of ALS (amyotrophic lateral sclerosis) or other conditions that can progressively impact speaking ability.” It was the subject of a touching and heartfelt video Apple made called “The Lost Voice.”

Here’s how to set up and use it.

How to use the iPhone’s new text-to-speech feature

You can see this feature in action in this video, alongside other new iOS 17 accessibility features:

Time needed: 30 minutes

How to set up Live Speech and create your Personal Voice

  1. Update to iOS 17

    Live Speech and Personal Voice are both included in iOS 17, released in September 2023. If you haven’t updated your phone in a long time, go to Settings > General > Software Update to make sure you have it. Live Speech running on iPhone

  2. Turn on Live Speech in Accessibility settings

    Go to Settings > Accessiblity > Live Speech (toward the bottom of the page). Enable “Live Speech” on top to turn it on. 

  3. Triple-click the iPhone side button for text-to-speech

    To use Live Speech, triple-click the side button of your iPhone. (On an older iPhone, triple-click the Home button). If a menu of options appears, select Live Speech.
    A keyboard will appear. Type whatever you want to say and hit Send. Your iPhone will play what you entered over the speakers. It’ll highlight word-by-word as it speaks it out. 

  4. Add favorite phrases for easy access

    In Settings > Accessibility > Live Speech, tap Favorite Phrases. Tap the + icon in the top to add a new one. The phrases you add here will be easier to select; you won’t have to type them out every time. 
    For example, if you have a pet, it might be convenient to add phrases like “Scout, home here,” “Indy, lay down” or “Wookiee, stop eating my salad” for instant access.
    To access your favorite phrases, activate Live Speech and tap Phrases in the popup menu. Tap any one of these to play it.Setting up Personal Voice on iPhone. One of the example phrases is “Have you written anything else?”

  5. Record your Personal Voice

    In Live Speech settings, you can choose a voice from any of the Siri (or classic Mac OS) voices.
    But the real killer feature is creating a Personal Voice. This lets you digitize your voice so that iPhone text-to-speech sounds like it’s really coming from you.
    Go back and tap Personal Voice > Create a Personal Voice. You should find a small, quiet room where you can speak uninterrupted for around 15 – 60 minutes. 
    Just follow the prompts on screen and read them aloud.

  6. Wait

    Plug in your iPhone and let it process. It needs to turn all your recordings into a dynamic digital model of your voice. It could take an hour or more.

  7. Set Live Speech to use your Personal Voice

    Go back to Settings > Accessibility > Live Speech and you should see a new option to use your Personal Voice instead of the other canned voices. Tap the Play button to hear a quick preview; tap the name to set it. 

More accessibility features

Once you’ve set up text-to-speech on your iPhone, be sure to check out more how-tos we’ve written on other great accessibility features:



[ad_2]

Source Article Link

Categories
News

Amphion open source Text-to-Speech (TTS) AI model

Amphion open source Text-to-Speech TTS AI model

If you’re venturing into the world of audio, music, and speech generation, you’ll be pleased to know that a new open-source AI  Text-to-Speech (TTS) toolkit called Amphion might be worth further consideration and investigation. Designed with both seasoned experts and budding researchers in mind, Amphion stands as a robust platform for transforming various inputs into audio. Its primary appeal lies in its ability to simplify and demystify the complex processes of audio generation.

Amphion’s Core Functionality

Amphion isn’t just another toolkit in the market. It’s a comprehensive system that offers:

  • Multiple Generation Tasks: Beyond the traditional Text-to-Speech (TTS) functionality, Amphion extends its capabilities to Singing Voice Synthesis (SVS), Voice Conversion (VC), and more. These features are in various stages of development, ensuring constant evolution and improvement.
  • Advanced Model Support: The toolkit includes support for a range of state-of-the-art models like FastSpeech2, VITS, and NaturalSpeech2. These models are at the forefront of TTS technology, offering users a variety of options to suit their specific needs.
  • Vocoder and Evaluation Metrics Integration: Vocoder technology is crucial for generating high-quality audio signals. Amphion includes several neural vocoders like GAN-based and diffusion-based options. Evaluation metrics are also part of the package, ensuring consistency and quality in generation tasks.

Why Amphion Stands Out

Amphion distinguishes itself through its user-friendly approach. If you’re wondering how this toolkit can benefit you, here’s a glimpse:

  • Visualizations of Classic Models: A unique feature of Amphion is its visualizations, which are especially beneficial for those new to the field. These visual aids provide a clearer understanding of model architectures and processes.
  • Versatility for Different Users: Whether you are setting up locally or integrating with online platforms like Hugging Face spaces, Amphion is adaptable. It comes with comprehensive guides and examples, making it accessible to a wide range of users.
  • Reproducibility in Research: Amphion’s commitment to research reproducibility is clear. It supports classic models and structures while offering visual aids to enhance understanding.

Amphion open source Text-to-Speech

Here are some other articles you may find of interest on the subject of  Text-to-Speech TTS AI :

Amphion’s technical aspects :

Let’s delve into the more technical aspects of Amphion:

  • Text to Speech (TTS): Amphion excels in TTS, supporting models like FastSpeech2 and VITS, known for their efficiency and quality.
  • Singing Voice Conversion (SVC): SVC is a novel feature, supported by content-based features from models like WeNet and Whisper.
  • Text to Audio (TTA): Amphion’s TTA capability uses a latent diffusion model, offering a sophisticated approach to audio generation.
  • Vocoder Technology: Amphion’s range of vocoders includes GAN-based vocoders like MelGAN and HiFi-GAN, and others like WaveGlow and Diffwave.
  • Evaluation Metrics: The toolkit ensures consistent quality in audio generation through its integrated evaluation metrics.

Amphion offers a bridge connecting AI enthusiasts, researchers and sound engineers to the vast and evolving world of AI audio generation. Its ease of use, high-quality audio outputs, and commitment to research reproducibility position it as a valuable asset in the field. Whether you are a novice exploring the realm of TTS or an experienced professional, Amphion offers a comprehensive and user-friendly platform to enhance your work.

The open source Amphion Text-to-Speech AI modeldemonstrates the power and potential of open-source projects in advancing technology. It’s a testament to the collaborative spirit of the tech community, offering a resource that not only achieves technical excellence but also fosters learning and innovation. So, if you’re looking to embark on or further your journey in audio generation, Amphion is your go-to toolkit. Its blend of advanced features, user-centric design, and commitment to research makes it an indispensable resource in the field.

 

Filed Under: Guides, Top News





Latest timeswonderful Deals

Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, timeswonderful may earn an affiliate commission. Learn about our Disclosure Policy.