Skip to main content

Try | Microsoft Project Oxford Emotion

Try | Microsoft Project Oxford Emotion


Happy? Sad? Angry? This Microsoft tool recognizes emotions in pictures
Humans have traditionally been very good at recognizing emotions on people’s faces, but computers? Not so much.

That is, until now. Recent advances in the fields of machine learning and artificial intelligence are allowing computer scientists to create smarter apps that can identify things like sounds, words, images – and even facial expressions.

The Microsoft Project Oxford team today announced plans to release public beta versions of new tools that help developers take advantage of those capabilities, including one that can recognize emotion. Chris Bishop, head of Microsoft Research Cambridge in the United Kingdom, showed off the emotion tool earlier today in a keynote talk at Future Decoded, a Microsoft conference on the future of business and technology.

The tools, many of which are used in Microsoft’s own products, are designed for developers who don’t necessarily have machine learning or artificial intelligence expertise but want to include capabilities like speech, vision and language understanding in their apps.

Microsoft released the first set of Microsoft Project Oxford tools last spring, and the project’s leaders say they quickly drew the interest of everyone from well-known Fortune 500 companies to small, scrappy startups who are eager for these capabilities but don’t have a team of machine learning and AI experts in their ranks.

“The exciting thing has been how much interest there is and how diverse the response is,” said Ryan Galgon, a senior program manager within Microsoft’s Technology and Research group.

Emotions, video, spell check and facial hair
Utilizing machine learning, these types of systems get smarter as they receive more data; the technology is the basis for major breakthroughs including Skype Translator’s real-time translation and Microsoft’s Cortana personal assistant.

In the case of something like facial recognition, the system can learn to recognize certain traits from a training set of pictures it receives, and then it can apply that information to identify facial features in new pictures it sees.

The emotion tool released today can be used to create systems that recognize eight core emotional states – anger, contempt, fear, disgust, happiness, neutral, sadness or surprise – based on universal facial expressions that reflect those feelings.

Galgon said developers might want to use these tools to create systems that marketers can use to gauge people’s reaction to a store display, movie or food. Or, they might find them valuable for creating a consumer tool, such as a messaging app, that offers up different options based on what emotion it recognizes in a photo.

The facial recognition technology that is part of Microsoft Project Oxford also can be used in plenty of other ways, such as grouping collections of photos based on the faces of people that appear in them.

Or, it can be used for more entertaining purposes. Earlier this week, in honor of the facial hair fundraising effort Movember, Microsoft released MyMoustache, which uses the technology to recognize and rate facial hair.

The emotion tool is available to developers as a public beta beginning today. In addition, Microsoft is releasing public beta versions of several other new tools by the end of the year. The tools are available for a limited free trial.

They include:

Spell check: This spell check tool, which developers can add to their mobile- or cloud-based apps and other products, recognizes slang words such as “gonna,” as well as brand names, common name errors and difficult-to-spot errors such as “four” and “for.” It also adds new brand names and expressions as they are coined and become popular. It’s available as a public beta beginning today.

Video: This tool lets customers easily analyze and automatically edit videos by doing things like tracking faces, detecting motion and stabilizing shaky video. It’s based on some of the same technology found in Microsoft Hyperlapse. It will be available in beta by the end of the year.

Speaker recognition: This tool can be used to recognize who is speaking based on learning the particulars of an individual’s voice. A developer could use it as a security measure since a person’s voice, like a fingerprint, is unique. It will be available as a public beta by the end of the year.

Custom Recognition Intelligent Services: This tool, also known as CRIS, makes it easier for people to customize speech recognition for challenging environments, such as a noisy public space. For example, a company could use it to help a team better use speech recognition tools while working on a loud shop floor or busy shopping center. It also could be used to help an app better understand people who have traditionally had trouble with voice recognition, such as non-native speakers or those with disabilities. It will be available as an invite-only beta by the end of the year.

Updates to face APIs: In addition to the new tools, Microsoft Project Oxford’s existing face detection tool will be updated to include facial hair and smile prediction tools, and the tool also has improved visual age estimation and gender identification.

Developers who are interested in these tools can find out more about them and give them a try by visiting the Microsoft Project Oxford website.








Comments

Popular posts from this blog

JDownloader

  JDownloader JDownloader is a free, open-source download management tool with a huge community that makes downloading as easy and fast as it should be. Users can start, stop or pause downloads, set bandwith limitations, auto-extract archives and much more. It's an easy-to-extend framework that can save hours of your valuable time every day!

Microsoft Process Explorer 17.00

 Microsoft Process Explorer 17.00 Ever wondered which program has a particular file or directory open? Now you can find out. Process Explorer shows you information about which handles and DLLs processes have opened or loaded. The Process Explorer display consists of two sub-windows. The top window always shows a list of the currently active processes, including the names of their owning accounts, whereas the information displayed in the bottom window depends on the mode that Process Explorer is in: if it is in handle mode you'll see the handles that the process selected in the top window has opened; if Process Explorer is in DLL mode you'll see the DLLs and memory-mapped files that the process has loaded. Process Explorer also has a powerful search capability that will quickly show you which processes have particular handles opened or DLLs loaded. The unique capabilities of Process Explorer make it useful for tracking down DLL-version problems or handle leaks, and provide insig...

OBS Studio 28.1

OBS Studio 28.1 Free and open source software for video recording and live streaming. Download and start streaming quickly and easily on Windows, Mac or Linux. OBS Studio (Open Broadcaster) is a free and open source software for video recording and live streaming. It features high-performance real-time video/audio capturing and mixing with unlimited scenes you can switch seamlessly via custom transitions. Filters for video sources are available, including image masking, color correction, chroma/color keying, an intuitive audio mixer with filter functions such as noise gate, noise suppression, and gain can improve the quality. OBS Studio is equipped with a powerful API, enabling plugins and scripts to provide further customization and functionality specific to your needs. Utilize native plugins for high performance integrations or scripts written with Lua or Python that interface with existing sources. Work with developers in the streaming community to get the features you need with end...