• Home
  • About Us
  • Contact Us
  • Privacy Policy
  • Special Offers
Business Intelligence Info
  • Business Intelligence
    • BI News and Info
    • Big Data
    • Mobile and Cloud
    • Self-Service BI
  • CRM
    • CRM News and Info
    • InfusionSoft
    • Microsoft Dynamics CRM
    • NetSuite
    • OnContact
    • Salesforce
    • Workbooks
  • Data Mining
    • Pentaho
    • Sisense
    • Tableau
    • TIBCO Spotfire
  • Data Warehousing
    • DWH News and Info
    • IBM DB2
    • Microsoft SQL Server
    • Oracle
    • Teradata
  • Predictive Analytics
    • FICO
    • KNIME
    • Mathematica
    • Matlab
    • Minitab
    • RapidMiner
    • Revolution
    • SAP
    • SAS/SPSS
  • Humor

Microsoft researchers improve AI tech for answering questions about photos

November 30, 2015   Big Data

Microsoft researchers are back at it, using a type of artificial intelligence called deep learning to build smarter software. Building on the company’s previous work, Microsoft Research employees have developed a new method of answering simple questions about the content of a photo more accurately than similar systems that have been demonstrated by other groups recently.

The approach involves two types of artificial neural networks — convolutional and a long short-term memory network — which generally train on large quantities of data, such as photos, and then make inferences about new photos that they are given. But the new method goes further by incorporating a “stacked attention network,” which effectively hones in on the key area in an image in order to answer a specific question. There are multiple layers in this network, and with each one there’s a greater level of zooming in, resulting in more accuracy.

“It’s taking on a human’s attention capability,” Li Deng, partner research manager at Microsoft Research’s Deep Learning Technology Center, said in an interview for a post today on the Next at Microsoft blog. “This is the technology that couldn’t have been imagined a few years ago — modeling human behavior to solve problems.”

The new method, which was documented this month in the paper “Stacked Attention Networks for Image Question Answering,” outperforms academic work that Microsoft researchers published earlier this year in a paper called “VQA: Visual Question Answering.”

Not that Microsoft is the only company investigating the possibilities of mixing natural language processing with image recognition.

The automatic creation of captions for photos has been explored extensively by the likes of Google as well as Microsoft. In the narrow realm of image question answering, Baidu, Huawei, and others have published papers on their progress, while Facebook recently demonstrated a mobile app that can allow blind people to ask spoken questions about what’s in a photo and receive spoken answers in response.

The big achievement here is accuracy. Especially when the system can answer questions with just one word, it performs better than what Baidu and Huawei have shown. Heck, a human being is not a whole lot better at this work. And if you look at the big picture, you know that’s a big deal.

This entry passed through the Full-Text RSS service – if this is your content and you’re reading it on someone else’s site, please read the FAQ at fivefilters.org/content-only/faq.php#publishers.

VentureBeat » Big Data News | VentureBeat

About, answering, Improve, Microsoft, photos, Questions, researchers, TECH
  • Recent Posts

    • MTG
    • TripleBlind raises $8.2 million for its encrypted data science platform
    • Ba’al comes to CPAC, Ted Cruz jokes about his Cancun trip
    • Optimizing data migration/integration with Power Platform
    • AI Weekly: Biden calls for $37 billion to address chip shortage
  • Categories

  • Archives

    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    • April 2020
    • March 2020
    • February 2020
    • January 2020
    • December 2019
    • November 2019
    • October 2019
    • September 2019
    • August 2019
    • July 2019
    • June 2019
    • May 2019
    • April 2019
    • March 2019
    • February 2019
    • January 2019
    • December 2018
    • November 2018
    • October 2018
    • September 2018
    • August 2018
    • July 2018
    • June 2018
    • May 2018
    • April 2018
    • March 2018
    • February 2018
    • January 2018
    • December 2017
    • November 2017
    • October 2017
    • September 2017
    • August 2017
    • July 2017
    • June 2017
    • May 2017
    • April 2017
    • March 2017
    • February 2017
    • January 2017
    • December 2016
    • November 2016
    • October 2016
    • September 2016
    • August 2016
    • July 2016
    • June 2016
    • May 2016
    • April 2016
    • March 2016
    • February 2016
    • January 2016
    • December 2015
    • November 2015
    • October 2015
    • September 2015
    • August 2015
    • July 2015
    • June 2015
    • May 2015
    • April 2015
    • March 2015
    • February 2015
    • January 2015
    • December 2014
    • November 2014
© 2021 Business Intelligence Info
Power BI Training | G Com Solutions Limited