Speechify.com — TTS and highlighting pronounced text feature

Speechify.com is a service that provides text-to-speech (TTS) technology primarily geared towards educational and productivity purposes. I was looking for the best solution on the highest quality text-to-speech solution and came across this service. I really liked the idea that the service has the functionality to highlight the spoken text in real time. Just check it out:

Speechify.com’s combination of high-quality text-to-speech and real-time highlighting can be particularly beneficial for various use cases, such as improving accessibility for individuals with visual impairments, enhancing comprehension for language learners, or increasing productivity for users who prefer to listen to written content while multitasking.

There is also a very cool presentation video with the creator of this startup —
Cliff Weitzman himself:

Very cool service, but in fact this function can be easily implemented using the API of chrome itself (the easiest way) or by using such services as Yandex TTS, Google Cloud Text-to-Speech, Amazon Polly, or Microsoft Azure Text-to-Speech. Moreover there are a lot of opensource services:

  1. eSpeak: eSpeak is a compact, open-source TTS engine that supports multiple languages and platforms. It’s designed to be lightweight and can be used in various applications, including embedded systems.
  2. Festival and Flite (Festival Lite): Festival is a general-purpose TTS system developed by the University of Edinburgh. It offers support for multiple languages and provides tools for building custom voices. Flite is a smaller and faster version of the Festival TTS system. It’s designed for resource-constrained environments and can be easily integrated into embedded systems and mobile applications.
  3. MaryTTS: MaryTTS is an open-source TTS platform that aims to provide high-quality, multilingual speech synthesis. It supports various languages and features customizable voice models.
  4. Mozilla TTS: Mozilla TTS is a deep learning-based TTS system developed by Mozilla. It uses neural network models to generate speech and offers high-quality, natural-sounding output.
  5. OpenTTS: OpenTTS is an open-source TTS engine developed by OpenAI. It’s based on cutting-edge research in deep learning and offers high-quality, expressive speech synthesis.

So, there is a lot of pre-ready-made solutions.

And now goes the question:
How to make Text-to-Speech with Word Highlighting in any web app?

Well, it is quite easy. I sketched out a simple demo example based on the Web Speech API.
And, by the way, such tasks are easily can be solved by using ChatGPT, Gemini or another AI-agent, just send the prompt message:

Create a CodePen project that implements a simple Text-to-Speech (TTS) functionality with word highlighting. The application should have the following features:

  1. An input area where users can input the text they want to be read aloud.
  2. A button to initiate the TTS process.
  3. As the text is being read aloud, each word should be highlighted.
  4. The highlighting should move along with the spoken word.
  5. The application should use the Web Speech API for TTS functionality.
  6. Implement error handling for speech synthesis errors.

Tasks:

  1. Set up a new code CodePen project.
  2. Create an HTML structure with an input area, a button, and a container for the highlighted text.
  3. Write JavaScript code to handle the TTS functionality.
  4. Use the Web Speech API to synthesize speech from the input text.
  5. Implement word highlighting as the text is being spoken.
  6. Handle errors that may occur during speech synthesis.
  7. Test the application to ensure proper functionality.
  8. Optionally, style the application to make it visually appealing.
  9. Ensure the code is properly documented and organized.

See the Pen TTS with Word Highlighting Demo by Nickolai Yegorov (@e-Nicko) on CodePen.

Pretty cool, right?
It really is that easy these days!

This is a basic example to get started. It is easy to enhance it further by adding more features, such as controlling the speed of the speech (rate property), pausing and resuming speech, or handling more complex text formatting (just as Speechify, it will be cool to make light highlight for whole current sentence and main highlight for the current reading word). Additionally, you can style the highlighted text and its container using CSS to make it visually appealing.


Definitely, this is a great find today!

Surely will use this functionality in my digital products and services! Of course with better TTS voices and more advanced features.

UsingEnglish.com — Comprehensive ESL Tools and Resources

UsingEnglish.com is a comprehensive online platform dedicated to English as a Second Language (ESL) tools and resources. Established in 2002, the site serves students, teachers, learners, and academics across ESL, EFL, ESOL, and EAP subject areas.

For Students:

UsingEnglish.com offers a diverse range of materials specifically designed for ESL learners. Whether you’re aiming to refine your grammar, expand your vocabulary, or practice language skills, our platform has you covered. Our user-friendly site has attracted millions of students each month, all benefitting from our grammar guides, vocabulary lists, exercises, and more. Join them and explore the wealth of resources available to help you improve your English proficiency.

  • Improve English skills with grammar guides, vocabulary lists, and exercises.
  • Explore reference materials covering idioms, phrasal verbs, irregular verbs, and grammar terms.
  • Test language knowledge with quizzes, tests, and reading comprehension exercises.

For Teachers:

UsingEnglish.com is go-to resource which packed with lesson plans, worksheets, handouts, and teacher training materials, our site is tailored to support both new and experienced educators. Explore our teacher resources, engage in the teacher forum, and discover articles and handouts to enhance your teaching skills.

  • Find lesson plans, worksheets, handouts, and teacher training resources.
  • Connect with other educators through the teacher forum.

Tools & Resources:

Access a variety of resources and tools that support your English studies. From text analysis and grammar usage to downloadable e-books and online English training, UsingEnglish.com provides a holistic learning experience.

  • Access resources such as text analysis, grammar and usage guides, downloadable e-books, and online English training.

Oxford Online English — Pronunciation Lessons

Diving into the ‘Mastering English Pronunciation‘ playlist by @Oxford Online English has been a game-changer for my language journey. 🎧 Each lesson feels like a personalized guide, helping me navigate the intricacies of English pronunciation with newfound confidence.

https://www.youtube.com/playlist?list=PLD6t6ckHsrubLp8Ia8duzu5fN4riM2-Bl

From dissecting the subtle dance of syllables and word stress to embracing the challenge of perfecting vowel sounds, this playlist is a treasure trove of insights. The lessons, presented in a friendly and engaging manner, have this incredible way of making complex concepts feel approachable.

What I appreciate most is the diverse range of topics covered. Whether it’s understanding the Schwa sound, unraveling the mysteries of sentence stress, or even exploring the nuances between British and American pronunciation, every lesson adds a layer to my linguistic toolkit.

The best part? It’s not just about mimicking sounds; it’s about truly understanding the ‘why’ behind each pronunciation nuance. The lessons on reading IPA and weak forms, for instance, have been eye-openers, giving me a deeper appreciation for the mechanics of language.

As someone on the journey to articulate thoughts more clearly and sound more like a native speaker, this playlist has become my go-to companion. It’s not just about sounding right; it’s about feeling confident in expressing myself authentically.

toPhonetics.com — Elevating Pronunciation Precision

Today, I stumbled upon a useful online tool that has made my journey into the English pronunciation an absolute delight. The website is called toPhonetics.com, and it has quickly become my go-to companion for deciphering the mysteries of English pronunciation.

Features of toPhonetics.com, an online converter that goes beyond basic phonetic transcription. This powerful tool offers a range of functionalities to enhance understanding of pronunciation:

  1. British and American Pronunciation Options: Choose between British and American pronunciation styles. Experience the nuanced differences, such as the distinct treatment of the [r] sound in British pronunciation.
  2. International Phonetic Alphabet (IPA) Symbols: Accurate and standardized IPA symbols are used in the phonetic transcriptions, providing a comprehensive guide to pronunciation.
  3. Preservation of Text Structure: Maintain the original structure of the text, including line breaks and punctuation marks, in the phonetic transcription output. This feature facilitates easier reading and reference.
  4. Show Weak Forms Option: Explore connected speech nuances with the option to show weak forms of words. Weak forms are italicized in the output, providing a clear distinction.
  5. Acronym Interpretation: Words in CAPS are interpreted as acronyms if not found in the database. Acronym transcriptions are displayed with hyphens between letters.
  6. Extensive Database: Beyond common vocabulary, the database includes a substantial collection of place names, nationalities, and popular names, enriching the range of transcriptions available.
  7. Side-by-Side or Line-by-Line Output: Tailor the display to your preference by choosing to see the text and its phonetic transcription side-by-side or line-by-line. This makes back-referencing to the original text seamless.
  8. Contextual Pronunciation Selection: In cases where a word has multiple pronunciations, select the one that aligns with the context by clicking on it. Hover over the word to see a popup with a list of possible pronunciations.
  9. Regular Database Updates: Stay current with language evolution. The dictionary database is regularly updated, addressing missing words and ensuring a comprehensive resource.
  10. Speech Synthesis Support: Take your learning to the next level by having the text read out loud in browsers with speech synthesis support, such as Safari and Chrome.

lingolia — structured grammar & vocabulary information

english.lingolia.com/en

Quite a good information project: good articles on grammar, there are reading and listening materials for different levels, interactive simulators, level testing.

But… what’s missing? – methodical roadmap for different levels.

It is much more convenient when a strict methodical plan with calculation by hours and taking into account interval repetition to consolidate the material.

Яндекс Переводчик — примеры

Очень удобный сервис для лучшего понимания контекста применения слова. Например, “я думаю” по-английски chatGPT рекомендовал написать как I believe, что в дословном переводе на русский означает “я верю”. Однако, судя по этимологии, по смыслу это ближе как hold something as an opinion, т.е. смысл именно полагать, думать, верить.

В данном случае, забив в Яндекс “I believe”, мы получаем множество наглядных контекстов применения данной лексемы:

Также ещё хорош сервис Google: