Data Management explained

Data is the information that drives business. It can be structured in rows and columns, like a customer name, address, and phone; and it can be unstructured, such as an email or a social media post. Structured data is what is populated in Relational Database Management Systems such as those created by Oracle, IBM and Microsoft, and open-source PostgreSQL and MySQL, among others. That data can be accessed using the standard Structured Query Language (SQL). Unstructured data resides in what are called NoSQL databases, such as Cassandra, Couchbase, MongoDB and many, many others. Many organizations today run both kinds of databases.

Once the data is stored, it must be easily retrievable, found amid the mountains of data organizations collect, and made available at scale. Numerous tools exist for those jobs, including Hadoop, Apache Spark and many more. It is through the collection and analysis of data that businesses can make decisions that affect their bottom line.

ScyllaDB 6.0 debuts with new replication architecture for greater elasticity

ScyllaDB has announced the release of version 6.0 of its database, which features a new replication architecture called “tablets.” The new architecture builds on the Raft consensus protocol and provides improvements to elasticity, speed, simplicity, and efficiency. ScyllaDB claims that now clusters can be doubled within 15 minutes, and their throughput in that same time … continue reading

Datadobi’s StorageMAP 7.0 release improves analytics capabilities for unstructured data

The data management company Datadobi has announced StorageMAP 7.0, the latest version of its unstructured data management platform.  “StorageMAP 7.0 is revolutionizing the way businesses and government organizations handle their unstructured data, transforming it into a powerhouse of strategic advantage,” said Carl D’Halluin, CTO of Datadobi.  StorageMAP 7.0 introduces two new analytics features: Custom Dashboards … continue reading

Unity Catalog – SD Times Open Source Project of the Week

Unity Catalog is an open source governance catalog for data and AI, developed by Databricks and open sourced earlier this week at the company’s Data + AI Summit.  Companies can use it to govern structured and unstructured data, and machine learning models, notebooks, dashboards, and files.  It offers interoperability with any data format and compute … continue reading

Cleansing email lists will help preserve your sender reputation score

Email is one of the most effective marketing channels out there. Compared to social media, where you are dependent upon the company’s algorithm treating your content favorably, email marketing can more effectively get your content in front of your audience. Email has an average open rate of 21.73% and an average click-through rate of 3.57%, … continue reading

Apache Pinot – SD Times Open Source Project of the Week

Apache Pinot is an open-source analytics platform that utilizes an OLAP database to provide low-latency insights into large amounts of data. OLAP stands for Online Analytical Processing and is a method in which data from multiple sources can be used together, allowing companies to group data from websites, applications, internal systems, and more together for … continue reading

Komprise launches new point-and-click solution for integrating AI services

Komprise, a company that provides management capabilities for unstructured data, has announced the release of a new service that will enable companies to more easily integrate their data with AI services.  According to Komprise, two of the biggest challenges to incorporating AI are discovering and feeding the right data and enriching data sets for AI, … continue reading

How data should inform modern platform development

Data is often seen as the output of applications — but it’s so much more than that. Like code itself, data is a fundamental building block of the development process. It has the ability to drive application logic, improve user experience, and increase business value. Therefore, taking a data-first approach to software design is critical. … continue reading

MongoDB Atlas gets a number of updates that enable new application use cases

At its annual developer conference, MongoDB.local NYC, MongoDB announced a number of new capabilities for its multi-cloud database MongoDB Atlas.  “Customers tell us they love MongoDB Atlas because it provides an integrated set of capabilities on one platform that can store and process their organization’s operational data across all of their applications,” said Sahir Azam, … continue reading

CData Sync Cloud brings CData’s ETL/ELT tool to the cloud

The data connectivity company CData has announced the release of CData Sync Cloud, a cloud-based ETL/ELT tool that brings the functionality of CData Sync to the cloud. The platform allows customers to replicate data from on-premises to on-premises, cloud to cloud, on-premises to cloud, and cloud to on-premises.  CData Sync Cloud provides integrations with over … continue reading

Redpanda launches serverless offering for its streaming data platform

Redpanda has announced a fully managed version of its streaming data platform, enabling developers to both get started quicker and not have to worry about managing infrastructure as they need to scale up. Redpanda Serverless allows developers to easily spin up new clusters for streaming data, which they can then use to build applications with.  … continue reading

Google Safe Browsing now performs real-time checks in privacy-preserving manner

Google has announced a new way to further protect its privacy-minded users who are browsing using Google Safe Browsing, which is a Google Search setting that warns users when they may be entering a potentially dangerous site.  While it has warned users about harmful sites across 5 billion devices since its launch 15 years ago, … continue reading

SD Times Open-Source Project of the Week: OpenHouse

LinkedIn has announced it is open sourcing its control plane for managing tables in data lakehouse deployments. The tool, called OpenHouse, has been in use at LinkedIn for the past year. The company has 3,500 OpenHouse tables in production currently.  It was designed to offer self-service management of tables in open data lakehouses. According to … continue reading

1 2 3 60 Protection Status