MIT CSAIL brings reasoning to machine learning

Published: November 1st, 2016

Machine learning is a hot topic in the software industry right now. More and more companies are taking advantage of this artificial intelligence technique to train machines on their data and make predictions. Researchers from MIT’s Computer Science and Artificial Laboratory (CSAIL) want to take it a step further by revealing how a machine makes those insights.

If you are using a machine learning system in the medical field to help diagnose the illness of a patient, you want to make sure that the system is correct. Similarly, if you are a business that uses machine learning to make investment decisions, you want to make sure you can justify the cost.

“One big thing with most artificial systems is, essentially, trust,” said Tommi Jaakkola, an MIT professor of electrical engineering and computer science. “If you are communicating with an AI system, you want to trust that the business decisions it is making [are] actually reasonable. One way to do that is to force the method to communicate the basis by which it made a particular decision. Any context where you have to take an action that has consequences, you would want to justify the predictions.”

To accomplish that, the researchers looked at a branch of machine learning: deep learning. Deep learning uses neural networks to learn from data and make intelligent decisions. Neural networks are a computational approach to mimicking the human brain.

To obtain the rationale behind the neural networks, the researchers divided them into in two modules: generator and encoder. The role of the generator is to extract, while the role of the encoder is to predict. The researchers then trained the two components to operate together.

“The encoder is essentially just a deep learning approach,” said Jaakkola. “It takes text and makes a prediction. You can run it in terms of the original text and learn the mapping, but it wouldn’t reveal anything about the justification for the prediction. Now we couple that with the generator that makes the selection. It selects the rationale and tests it through the encoder to see if it alone would suffice to make that prediction.”

The researchers tested this technique on beer reviews. According to Jaakkola, the researchers chose beer reviews because they provide a predictable aspect (the score people gave based on the beer). “The goal is not to do justification for beer reviews, but because they have the data already annotated, it served as a great test bed for seeing whether the method would work,” he said.

The experiment included a five-star rating system based on aroma, palate and appearance. The results showed 95% to 96% agreement with human annotation based on appearance and aroma, and 80% based on palate.

In addition, the researchers applied the technique to pathology reports to examine breast biopsies and diagnoses. According to Tao Lei, an MIT graduate student in electrical engineering and computer science, they were able to predict the results with 97% to 98% accuracy.

Jaakkola explained that this technique could already be broadly applied to many systems by reformatting them. “Everything happens totally unsupervised, meaning that we don’t need any annotation of what the rationales are. We are just decoupling the process of prediction in order to force the method to come up with the rationale,” he said.

Going forward, the researchers will look into how to make the technique more sophisticated. For instance, a broader direction of the research is looking into how to communicate with complex learning systems. “You might have an idea about the rationale that is suitable for this particular prediction. How would you communicate that with a system? It opens up the possibility of exploring bidirectional communication on human terms with complex machine learning predictors,” Jaakkola said.

Specifically, the researchers will look into expressing types of rationales from one context to another. “What is a reasonable rationale, say, for a medical context might not be the same as a reasonable rationale in investment decisions,” said Jaakkola. “There are structure and constraints in each domain. We can explore more sophisticated ways of articulating those rationales and incorporating them in the overall learning algorithm.”

More information is available in the paper “Rationalizing Neural Predictions” by Regina Barzilay, Jaakkola and Lei.

Article Tags

AI, artificial intelligence, deep learning, machine learning, MIT Computer Science and Artificial Laboratory, MIT CSAIL, neural nets, neural networks

About Christina Cardoza

Christina Cardoza is the News Editor of SD Times. She is responsible for the oversight of the daily news published to the website as well as the company's weekly newsletter, News on Monday. She covers agile, DevOps, AI, machine learning, mixed reality and software security. She is an undeniable nerd who loves Marvel comics and Star Wars. On Follow her on Twitter at @chriscatdoza!

View all posts by Christina Cardoza

Cookie	Duration	Description
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_S6PB8V57DG	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_846073_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_jsuid	1 year	This cookie contains random number which is generated when a visitor visits the website for the first time. This cookie is used to identify the new visitors to the website.
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
WMF-Last-Access	1 month 14 hours 26 minutes	This cookie is used to calculate unique devices accessing the website.

Cookie	Duration	Description
__Host-GAPS	2 years	This cookie allows the website to identify a user and provide enhanced functionality and personalisation.
_pxhd	session	Used by Zoominfo to enhance customer data.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
__Secure-YEC	1 year 1 month	No description
_heatmaps_g2g_100754890	10 minutes	No description
_techvalidate_session	session	No description
cf_7166_id	20 years	No description
cf_7166_person_last_update	session	No description
f5avraaaaaaaaaaaaaaaa_session_	session	No description available.
GoogleAdServingTest	session	No description
Gyazo_cfwoker	7 years 2 months 17 days 7 hours	No description
incap_ses_451_2783402	session	No description
incap_ses_769_2783402	session	No description
loglevel	never	No description available.
m	2 years	No description available.
nlbi_2783402	session	No description
prism_252377639	1 month	No description
TS011605d9	session	No description
ustream-guest	session	No description available.
visid_incap_2783402	1 year	No description
xtc	1 year 1 month	No description

AI

AI and Software Development

Observability

Guide to Observability

CI/CD

A guide to CI/CD

Cloud Native

Cloud Native Content

Data

A Guide to Data

Test

Security Testing

Mobile

Mobile Testing

API

Sponsored by Parasoft

Performance

Load & Performance Testing

DevSecOps

A Guide to DevSecOps

Enterprise Security

A Guide to Security

Supply Chain Security

Supply Chain Security

Dev Manager

Dev Managers Content

Agile

A Guide To Agile

Value Stream

A Guide To Value Stream

Productivity

A Guide To Productivity

DevOps

DevOps Content

API

Gravitee.io

AI

AI and Software Development

Value Stream Management

A Guide To Value Stream

MIT CSAIL brings reasoning to machine learning

Article Tags

Subscribe to SDTimes

About Christina Cardoza

Related Articles

The AI productivity paradox in software engineering: Balancing efficiency and human skill retention

Gartner: More than 40% of agentic AI projects will be canceled in the next few years

June 2025: All AI updates from the past month

This week in AI dev tools: A2A donated to Linux Foundation, OpenAI adds Deep Research to API, and more (June 27, 2025)