To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

Speech Recognition & Synthesis

From Wikipedia, the free encyclopedia

Speech Recognition & Synthesis
Developer(s)Google
Initial releaseNovember 13, 2013; 10 years ago (2013-11-13)
Stable release
20231225.02_p0.593665078(Android 8-14) / December 25, 2023; 35 days ago (2023-12-25)[1]
Operating systemAndroid
TypeScreen reader

Speech Recognition & Synthesis, formerly known as Speech Services,[2] is a screen reader application developed by Google for its Android operating system. It powers applications to read aloud (speak) the text on the screen, with support for many languages. Text-to-Speech may be used by apps such as Google Play Books for reading books aloud, Google Translate for reading aloud translations for the pronunciation of words, Google TalkBack, and other spoken feedback accessibility-based applications, as well as by third-party apps. Users must install voice data for each language.

Supported languages

  • Albanian (Albania)
  • Arabic
  • Bengali (Bangladesh)
  • Bengali (India)
  • Bosnian (Bosnia and Herzegovina)
  • Bulgarian (Bulgaria)
  • Cantonese (Hong Kong)
  • Catalan (Spain)
  • Chinese (China)
  • Chinese (Taiwan)
  • Croatian (Croatia)
  • Czech (Czech Republic)
  • Danish (Denmark)
  • Dutch (Belgium)
  • Dutch (Netherlands)
  • English (Australia)
  • English (Nigeria)
  • English (India)
  • English (United Kingdom)
  • English (United States)
  • Estonian (Estonia)
  • Filipino (Philippines)
  • Finnish (Finland)
  • French (Canadian)
  • French (France)
  • German (Germany)
  • Greek (Greece)
  • Gujarati (India)
  • Hebrew (Israel)
  • Hindi (India)
  • Hungarian (Hungary)
  • Icelandic (Iceland)
  • Indonesian (Indonesia)
  • Italian (Italy)
  • Japanese (Japan)
  • Javanese (Indonesia)
  • Kannada (India)
  • Khmer (Cambodia)
  • Korean (South Korea)
  • Latvian (Latvia)
  • Lithuanian (Lithuania)
  • Malay (Malaysia)
  • Malayalam (India)
  • Marathi (India)
  • Nepali (Nepal)
  • Norwegian Bokmål (Norway)
  • Polish (Poland)
  • Portuguese (Brazil)
  • Portuguese (Portugal)
  • Punjabi (India)
  • Romanian (Romania)
  • Russian (Russia)
  • Sinhala (Sri Lanka)
  • Slovak (Slovakia)
  • Spanish (Spain)
  • Spanish (United States)
  • Sundanese (Indonesia)
  • Swahili (Kenya)
  • Swedish (Sweden)
  • Tamil (India)
  • Telugu (India)
  • Thai (Thailand)
  • Turkish (Turkey)
  • Ukrainian (Ukraine)
  • Urdu (Pakistan)
  • Vietnamese (Vietnam)
  • Welsh (United Kingdom)

History

Some app developers have started adapting and tweaking their Android Auto apps to include Text-to-Speech, such as Hyundai in 2015.[3] Apps such as textPlus and WhatsApp use Text-to-Speech to read notifications aloud and provide voice-reply functionality.

Google Cloud Text-to-Speech is powered by WaveNet,[4] software created by Google's UK-based AI subsidiary DeepMind, which was bought by Google in 2014.[5] It tries to distinguish from its competitors, Amazon and Microsoft.[6]

DeepMind's AI voice synthesis tech is notably advanced and realistic. Most voice synthesizers (including Apple's Siri) use concatenative synthesis,[4] in which a program stores individual phonemes and then pieces them together to form words and sentences.

WaveNet generates speech that sounds more natural than other text-to-speech systems. It synthesizes speech with more human-like emphasis and inflection on syllables, phonemes, and words. On average, a WaveNet produces speech audio that people prefer over other text-to-speech technologies. Unlike most other text-to-speech systems, a WaveNet model creates raw audio waveforms from scratch. The model uses a neural network that has been trained using a large volume of speech samples. During training, the network extracts the underlying structure of the speech, such as which tones follow each other and what a realistic speech waveform looks like. When given a text input, the trained WaveNet model can generate the corresponding speech waveforms from scratch, one sample at a time, with up to 24,000 samples per second and smooth transitions between the individual sounds.[4]

The service was renamed Speech Recognition & Synthesis in 2023.[citation needed]

See also

References

  1. ^ "Speech Services by Google APKs". APKMirror.
  2. ^ Wang, Jules (November 8, 2021). "You'll never guess the latest Google app to cross 10 billion installs (seriously)". Android Police. Archived from the original on November 8, 2021. Retrieved November 18, 2021.
  3. ^ "Google, Hyundai show off new third-party Android Auto apps". CNET. CBS Interactive. Retrieved 17 January 2015.
  4. ^ a b c "WaveNet". www.deepmind.com. Retrieved 2023-06-22.
  5. ^ Gibbs, Samuel (2014-01-27). "Google buys UK artificial intelligence startup Deepmind for £400m". The Guardian. ISSN 0261-3077. Retrieved 2023-06-22.
  6. ^ "Text-to-Speech AI: Lifelike Speech Synthesis". Google Cloud. Retrieved 2023-06-22.

External links

This page was last edited on 18 January 2024, at 03:55
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.