Five tips for big integration projects

Column

Published: April 15th, 2014

Software development isn’t easy. And the bigger the software gets, the harder it is to build right from the ground up. A string of high-profile failures has given us a timely reminder of this. But let’s look at the HealthCare.gov fiasco in a little more detail. Yes, there were some user interface idiosyncrasies. Yes, the site wasn’t up to handling the traffic it received. Yes, users were turned away, certain plans or providers were not included, and many of those precious few applications that were submitted had unexplained errors. But if we look past the symptoms, underlying much of this seemed to be serious problems with the system integration.

This entire site was essentially a friendly face on a massive integration project, and it’s certainly not alone in that, in the healthcare space or others. But massive integration projects can be done well, even at the scale we’re talking about. Here are five things to consider to bump the odds in your favor:

1. Separate integration logic from application logic
A user-facing application shouldn’t be responsible for translating data formats, converting between synchronous and asynchronous requests, or retrying when a partner system is down. Not to mention that there shouldn’t be any need to redeploy an application if a partner endpoint changes. Application frameworks and application developers are better at building applications, while integration frameworks and tools are better at handling integration challenges. It’s best to build an application to send and receive idealized formats, and to use external integration tools to handle detailed transformations and other processing.

Does this mean an ESB? Spring Integration or Apache Camel? An enterprise integration product? Just a pile of custom code? It’s hard to give a specific answer. We’ve seen solutions based on all of the above, from Groovy scripts to off-the-shelf products. The concept is the important part: building a service that can easily adapt to changing integration partners, formats or endpoints, and leaving the application side of the interface fairly clean.

2. Use tools that make integration easy
When extensive integration is a given, it makes the most sense to select tools that make integration easy. This means easy to configure, easy to develop transformations and other logic, easy to test, easy to debug, and easy to deploy. It means supporting multiple options for building integration logic—perhaps avoiding ugly generated code in favor of dynamic inspection, perhaps avoiding complex GUIs in favor of simple blocks of code. It means making it easy to reject or queue requests when an endpoint is down, easy to update individual integration flows, and easy to scale out to handle additional load.

The problem is that most options look the same from a spec sheet. The only way to tell is with some hands-on experience. How easy is it for a developer get some integration code and corresponding tests up and running on his or her machine? Can the configuration or code be sensibly version-controlled, and flow through already established channels for continuous integration and continuous deployment? How easy is it to deploy versioned integration logic, supporting multiple releases of a client or endpoint? Can it be secured conveniently if needed? Most likely a proof of concept will be the only way to answer these questions.

3. Parallel implementations can work
Projects with many integration points tend to have a large number of interested parties, many of whom may be developing their parts of the system in parallel. Typically each group finishes and tests their components, then hooks them up to the rest for testing. When you put all these first drafts together, all late in the game, you typically run into a lot of late-breaking problems. But there’s an easy way to avoid that. If everybody starts by providing simulated requests and replies, then testing can begin on day one and each group can phase in real implementations over time. Each system gets immediate feedback as to whether anything’s breaking, and there’s much less debugging required to fix small changes from a known working state.

The easiest way to start is by capturing a request or reply message (if there’s a working channel available, either production or test). Failing that, perhaps to construct a simple message by hand. Every time the integration channel is invoked, return the same static message. Perhaps broaden out from there to a handful of messages, if there are different types of requests or replies. Then slowly start phasing in actual logic. That may mean substituting parts of the static messages with real content piece by piece, or using code to process the requests and replies but feeding it initially with largely static data. Over time, add the needed logic behind the scenes, populate the rest of the common requests or replies, and then work out toward the edge cases. If both sides can agree on an order of attack, so much the better.

It still won’t help if one side takes the firewall approach, not releasing an iota of code until it’s “done.” But even if things look to be heading that way, telling your partners that your side of the pipe is in place and ready for testing on day one may encourage them to give it a go, and any feedback you get will help.

4. Build in monitoring from the start
Let’s face it: problems will happen. Partner systems will be down, data formats will change, delays will accumulate. Besides simply firing alerts when problems crop up, integrated monitoring can help reproduce and troubleshoot problems. Bad requests can be stashed for later use, or a long series of requests can be captured for playback in a load test. Timing requests and gathering statistics across request types can suggest when to separate certain flows onto different machines for performance. But all that aside, simply monitoring for failures in any of the integration points can mean the difference between proactive fixes and customers complaining that their requests were never completed.

It’s all well and good to build a system that has monitoring support, but take the opportunity early to use it. Build automated tests with messages captured via the monitoring channels. When integrating new partners, instead of manually inspecting early test responses, let the monitoring system inform you when a problem crops up. It may seem like more work up front, but having all the plumbing in place sure beats “monitoring support” that’s never been tried in practice.

5. Expect change
If there’s one certainty in the world of software, it’s change. Systems will be upgraded, formats will be revised, features will be added. The more systems are interconnected, the more an isolated change is likely to have unanticipated repercussions.

In some ways, all the prior points have been leading up to this. When one endpoint changes, it’s better if the integration logic is isolated from the related application logic, so accommodating the change doesn’t automatically require changes to additional applications. It’s better if the related integration logic is easy to update, test and deploy. It’s better if altered requests or responses are immediately flagged, and can be captured and replayed in testing.

And one step further: It’s best if all the systems have a comprehensive automated test suite. When these changes happen, it’s much easier to test a fix if the computer can run the entire scenario for you. Pop in the latest messages from the error queue and instantly reproduce the problem. That probably depends, of course, on having an adequate test environment, with real, cleansed or adequately simulated production data, and a process to keep it up to date. It’s worth the investment, though: If change is a given, you’ll be leaning heavily on the test environment. With today’s tools, there’s no reason it can’t be straightforward to replicate the production stack where needed, whether on your own machine or on a temporary node in the cloud.

So the next time you kick off a big integration project, remember: It’s possible to do it right. When 10% of the applications submitted via HeathCare.gov have unrecoverable errors, we should all be embarrassed.

Aaron Mulder is CTO and director at Chariot Solutions, responsible for the design and deployment of technical development standards, policies and style guidelines. He has been working with Java technology since its inception. He has directly contributed to many open-source projects, including Apache projects such as Geronimo, ActiveMQ, ServiceMix, OpenEJB, and XBean, as well as other projects, including JBoss and PostgreSQL.

Guest Views are contributions by SD Times readers. Interested in contributing a Guest View? See the guidelines for the details.

About aaron.mulder

View all posts by aaron.mulder

Cookie	Duration	Description
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_S6PB8V57DG	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_846073_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_jsuid	1 year	This cookie contains random number which is generated when a visitor visits the website for the first time. This cookie is used to identify the new visitors to the website.
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
WMF-Last-Access	1 month 14 hours 26 minutes	This cookie is used to calculate unique devices accessing the website.

Cookie	Duration	Description
__Host-GAPS	2 years	This cookie allows the website to identify a user and provide enhanced functionality and personalisation.
_pxhd	session	Used by Zoominfo to enhance customer data.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
__Secure-YEC	1 year 1 month	No description
_heatmaps_g2g_100754890	10 minutes	No description
_techvalidate_session	session	No description
cf_7166_id	20 years	No description
cf_7166_person_last_update	session	No description
f5avraaaaaaaaaaaaaaaa_session_	session	No description available.
GoogleAdServingTest	session	No description
Gyazo_cfwoker	7 years 2 months 17 days 7 hours	No description
incap_ses_451_2783402	session	No description
incap_ses_769_2783402	session	No description
loglevel	never	No description available.
m	2 years	No description available.
nlbi_2783402	session	No description
prism_252377639	1 month	No description
TS011605d9	session	No description
ustream-guest	session	No description available.
visid_incap_2783402	1 year	No description
xtc	1 year 1 month	No description

AI

AI and Software Development

Observability

Guide to Observability

CI/CD

A guide to CI/CD

Cloud Native

Cloud Native Content

Data

A Guide to Data

Test

Security Testing

Mobile

Mobile Testing

API

Sponsored by Parasoft

Performance

Load & Performance Testing

DevSecOps

A Guide to DevSecOps

Enterprise Security

A Guide to Security

Supply Chain Security

Supply Chain Security

Dev Manager

Dev Managers Content

Agile

A Guide To Agile

Value Stream

A Guide To Value Stream

Productivity

A Guide To Productivity

DevOps

DevOps Content

API

Gravitee.io

AI

AI and Software Development

Value Stream Management

A Guide To Value Stream

Five tips for big integration projects

Subscribe to SDTimes

About aaron.mulder

Related Articles

Industry Watch: Security first and foremost

Industry Watch: Internet crime complaints rise

Guest View: OLAP + OLTP = …PostgreSQL?

Potentially huge new markets for developers