Root cause analysis metrics can improve software quality

Published: July 17th, 2013

More DevOps teams should be employing root cause analysis (RCA) to defects. The advantages are clear and indisputable. RCA metrics on defects can be leveraged to improve software quality by fixing the ineffective areas of the software development process such as requirements, design, code verification, unit testing, test planning, and QA testing. The result is drastic improvements in the overall quality of the software, and that means happy customers and lower development costs. Still not convinced? Let’s dig a little deeper.

What is RCA about? The methodology of RCA grew out of a need to identify the underlying factors that contributed to a system failure or an adverse event of some kind. Using the analogy of a plane crash, the RCA would pull in the black box data along with the plane’s designers, mechanics and pilots with experience flying that particular model. The aim would be to find the cause or causes of the crash and make the necessary changes to prevent it from happening again.

While RCA has traditionally been employed in hardware engineering, it can also work very well with software engineering. For software developers, RCA is about pulling together the businesspeople, engineers, and QA department to figure out why a defect was introduced. This means going back to the original requirements; checking the design, code implementation, test plans and test execution cycles; and identifying the root cause of the defect. The process requires careful analysis and classification of the root cause if it is to work well, but the benefits far outweigh the time spent in categorizing RCA and acting on defects.

Defects cost the U.S. $60 billion
Examining the root causes of a defect can help to establish breakdowns in communication between team members, weak links in processes that can be corrected, or training needs for individuals or groups. It has been more than a decade since the National Institute of Standards and Technology discovered that software defects are costing the U.S. economy US$59.5 billion annually. It found that up to 80% of development costs were being spent on identifying and fixing defects, and yet the end products were still shipping with unidentified defects.

If RCA metrics are employed efficiently, then defects can be traced to the source and processes can be tweaked or improved in order to eliminate them before they float downstream to QA (or worse, to the customer). By investigating and discovering the underlying causes, you can prevent those fires from starting in the first place, instead of focusing on extinguishing them.

It is inevitable that you will encounter some resistance when you track RCA metrics. It is vital to establish that the process is not about playing the blame game. Focus on the idea that the process is at fault, rather than the individual, and everyone needs to pull together in order to improve the process.

Relatively detailed data is a prerequisite, and it is vital that you appoint a judge (usually the project manager) who can rule on the investigation and prevent things from descending into a shouting match. A good level of documentation is important. As the group works backward from the discovery of the defect, whether it was identified by the end customer or the QA tester, or perhaps even earlier by the developer, it’s vital to document all defects and their root cause throughout all phases of the software development process (e.g. requirements review, design review, code review, testing).

It could be that the original requirements were not clear, or maybe the developer applied their own interpretation, or perhaps the QA team missed testing for that potential scenario. Whatever the case, when the cause has been identified, move the focus swiftly to how it could be prevented. The aim is to document an improvement to the process that ensures a similar defect doesn’t arise in the future. For example, a possible fix may be to implement a checklist for the requirements document, or stage a more formal review of the design that includes BA and QA members. Perhaps you need to formally check during implementation that the developers have conducted unit testing or peer code reviews. It may be necessary for QA to write more detailed test cases, or to take part in the code-verification process.

Look beyond the post mortem. Many companies will only engage in RCA for production defects that make it through to customers, or for very serious defects discovered in production, but the process can be usefully applied throughout all phases of software development. The greater your volume of data, the better your chance of identifying patterns that signal a fundamental problem with a process that can potentially be solved.

Several months of data can give you insights that you’ll never get from a few production-critical defects only. It can also be used to tighten processes and improve efficiency for every project going forward, not just the software you happen to be working on right now.

It is the QA department’s job to champion this process. They should lead the investigation and force the necessary changes, backed up by the certainty of statistical analysis. This can lead to serious improvements to major processes, can help prevent software from being released prematurely, and can enable more-effective use of all resources. These actions also work perfectly with the QA department’s ultimate aim, which is to identify all defects by any means necessary and ensure that the customer doesn’t find them.

Lack of time is false economy
The adoption of RCA methodology has been widespread, and its ability to save lives when applied to the healthcare industry or accident investigation is well documented, but there’s no reason it should be confined to these fields. The pressure to deliver high-quality software quickly and roll out new features as soon as possible is a real problem for the software development industry. The old adage about more haste, less speed holds true.

Kaushal Amin is CTO of KMS Technology, a software development and IT services firm based in Atlanta and Ho Chi Minh City, Vietnam.

Article Tags

root cause analysis

About Kaushal Amin

View all posts by Kaushal Amin

Cookie	Duration	Description
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_S6PB8V57DG	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_846073_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_jsuid	1 year	This cookie contains random number which is generated when a visitor visits the website for the first time. This cookie is used to identify the new visitors to the website.
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
WMF-Last-Access	1 month 14 hours 26 minutes	This cookie is used to calculate unique devices accessing the website.

Cookie	Duration	Description
__Host-GAPS	2 years	This cookie allows the website to identify a user and provide enhanced functionality and personalisation.
_pxhd	session	Used by Zoominfo to enhance customer data.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
__Secure-YEC	1 year 1 month	No description
_heatmaps_g2g_100754890	10 minutes	No description
_techvalidate_session	session	No description
cf_7166_id	20 years	No description
cf_7166_person_last_update	session	No description
f5avraaaaaaaaaaaaaaaa_session_	session	No description available.
GoogleAdServingTest	session	No description
Gyazo_cfwoker	7 years 2 months 17 days 7 hours	No description
incap_ses_451_2783402	session	No description
incap_ses_769_2783402	session	No description
loglevel	never	No description available.
m	2 years	No description available.
nlbi_2783402	session	No description
prism_252377639	1 month	No description
TS011605d9	session	No description
ustream-guest	session	No description available.
visid_incap_2783402	1 year	No description
xtc	1 year 1 month	No description

AI

AI and Software Development

Observability

Guide to Observability

CI/CD

A guide to CI/CD

Cloud Native

Cloud Native Content

Data

A Guide to Data

Test

Security Testing

Mobile

Mobile Testing

API

Sponsored by Parasoft

Performance

Load & Performance Testing

DevSecOps

A Guide to DevSecOps

Enterprise Security

A Guide to Security

Supply Chain Security

Supply Chain Security

Dev Manager

Dev Managers Content

Agile

A Guide To Agile

Value Stream

A Guide To Value Stream

Productivity

A Guide To Productivity

DevOps

DevOps Content

API

Gravitee.io

AI

AI and Software Development

Value Stream Management

A Guide To Value Stream

Root cause analysis metrics can improve software quality

Article Tags

Subscribe to SDTimes

About Kaushal Amin

Related Articles

Applitools Root Cause Analysis pinpoints CSS and DOM bugs in web applications

APM for today’s new architectures

Microsoft releases automated root cause analysis tool