Researchers at MIT have developed a new system that combines voice and object recognition capable of identifying an object within an image given only a spoken description of that image. When provided with an image and audio caption, the system is able to highlight the relevant regions of the image being described, in real-time. Current … continue reading
Google’s machine learning, speech synthesis technology is now officially available. Google Cloud Text-to-Speech was originally published by DeepMind about two years ago and updated about a year ago with WaveNet research, and announced by Google in March. The technology allows developers to “synthesize natural-sounding speech with 30 voices, available in multiple languages and variants,” the … continue reading
Mozilla announced a mission to help developers create speech-to-text applications earlier this year by making voice recognition and deep learning algorithms available to everyone. Today, the company’s machine learning group is one step closer to completing that mission with the initial open source release of its speech recognition model and voice dataset. “There are only … continue reading
When the Windows 10 Fall Creators Update ships later this fall, Windows Subsystem for Linux (WSL) will leave beta. The company announced WSL will be a fully supported Windows feature in the upcoming update. “This will be great news for those who’ve held-back from employing WSL as a mainline toolset: You’ll now be able to … continue reading
Facebook CEO Mark Zuckerberg started 2016 off with one goal: to build an artificial intelligence personal assistant. The AI is known as Jarvis, or Just a Rather Very Intelligent System. Now that 2016 is coming to an end, Zuckerberg is sharing what he has accomplished so far. Today, he has built a simple AI designed … continue reading
Microsoft is making waves with its artificial intelligence research by developing a piece of technology that recognizes words as well as humans do. According to a team of researchers from the company’s AI and Research division, the company’s speech recognition system has a word error rate of 5.9%, which is about equivalent to a human. … continue reading
Progress has announced the latest release of its UI toolbox for web, mobile and desktop development. The latest version of Telerik DevCraft features support for Angular 2, ASP.NET Core 1.0 and Universal Windows Platform. Additional features include new CheckBoxList, RadioButtonList and RichTextBox components; new chart types; the ability to convert legacy WinForms apps to modern … continue reading
Google is expanding its Cloud Platform to help developers build intelligent applications. The company has introduced the Cloud Natural Language API and Cloud Speech API into open beta. The Cloud Natural Language API enables developers to perform sentiment analysis, entity recognition and syntax analysis in a number of languages. Sentiment analysis aims to understand the … continue reading
When her son was diagnosed with dyslexia, Ari Fertel started reading with him several times a day. Later, when she had four more children, she found they all had different reading levels, and it became a challenge to read with them all for the same amount of time. Her son was later told he no … continue reading
Stephen Hawking has a shiny new communications system courtesy of Intel and British language technology company SwiftKey, and he’s sharing the tech with the world. (He also may have hinted that artificial intelligence will bring about the apocalypse.) Hawking’s new ACAT (Assistive Context Aware Toolkit) replaces the 20 year-old system he was using (and finding … continue reading