Topic: sre

SD Times news digest: Gremlin Automatic Service Discovery, WhiteHat Attack Surface Management, and Jamf’s same-day Apple OS support

Gremlin has added Automatic Service Discovery to its chaos engineering platform in an effort to help companies improve resilience and reduce downtime by identifying the various services running across distributed systems.  “The rise in popularity of microservices necessitate services functioning as first-class citizens. The infrastructure layer is becoming more abstract and engineers are increasingly thinking … continue reading

SD Times news digest: Qt acquires froglogic, the Embedded Software Testing & Compliance Summit, and Catchpoint’s virtual SRE community event

Qt announced that it will acquire froglogic GmbH, a major provider of quality assurance tools, to bring froglogic’s test automation tools into the Qt product portfolio. “As The Qt Company continues its growth, the acquisition of froglogic is an important milestone in  broadening Qt’s best-in-class software development tools and building in automated testing and code … continue reading

Report finds chaos engineering can significantly decrease MTTR and increase availability

A new report revealed those who have successfully implemented chaos engineering have 99.9% or higher availability and greatly improved their mean time to resolution (MTTR).  Gremlin’s inaugural 2021 State of Chaos Engineering report found 23% of teams who frequently run chaos engineering projects had a MTTR of under 1 hour, and 60% under 12 hours. … continue reading

GitLab collaborates with Google Cloud on app deployment

GitLab announced a new collaboration with Google Cloud to offer native integration into Google Kubernetes Engine (GKE). This new integration aligns with GitLab’s vision of Auto DevOps. Auto DevOps is GitLab’s way of automating DevOps and delivering ideas to production faster. It consists of a collection of build, test and deployment features. The new integration … continue reading

LinkedIn open sources Kafka Cruise Control

Although Apache Kafka is widely adopted, there are still operational challenges that teams run into when they try to run Kafka at scale. In order to restore balance to Kafka clusters, LinkedIn open sourced and developed Cruise Control, its general-purpose system that continuously monitors clusters and automatically adjusts the resources needed to meet pre-defined performance … continue reading

Site Reliability Engineer: Don’t fall victim to the bias blind spot

To ensure websites and applications deliver consistently excellent speed and availability, some organizations are adopting Google’s Site Reliability Engineering (SRE) model. In this model, a Site Reliability Engineer (SRE) – usually someone with both development and IT Ops experience – institutes clear-cut metrics to determine when a website or application is production-ready from a user … continue reading

Industry Watch: Site Reliability Engineers make DevOps work

Business today is software. Every company, whether it realizes it or not, is a software company. For online retailers, banks, investment firms, news organizations, insurance companies—just about every business, in fact—applications are their outward face to their clients. But applications today are complex. No longer monolithic in nature, apps today involve API calls to various … continue reading

Ad will close in seconds
Continue to site
HTML Snippets Powered By : XYZScripts.com

Get access to this and other exclusive articles for FREE!

There's no charge and it only takes a few seconds.

Sign up now!