Topic: data

SD Times Open-Source Project of the Week: Apache Iceberg

Apache Iceberg is an open table format for datasets that can be used with compute engines like Spark, Trino, PrestoDB, Flink, and Hive.  It has a lot of failsafes in place to ensure that users don’t accidentally mess up a table with a wrong command.  Its schema evolution supports tasks like add, drop, update, or … continue reading

India’s data protection plan would affect how data is managed there

Several countries have successfully implemented major data privacy and protection regulations over the past decade. The EU’s General Data Protection Regulation (GDPR) drastically changed how companies managed data, not just for their customers in the EU, but worldwide. Then came the California Consumer Privacy Act (CCPA), which had a similar cascading effect when companies decided … continue reading

Data democratization and integration needed to take advantage of massive data explosion

Data is the key to success for all parts of a business these days. Data analysis is no longer limited to a particular data-focused team, but rather done by anyone looking to gain insight into how well their team is performing, determining business value, efficiencies, and inefficiencies, and so much more. But in order to … continue reading

Accurate data isn’t enough — it needs to be actionable

As the COVID-19 pandemic has shut down offices, people have taken to working remotely, and many have even moved from locations where the cost of living is high to places where it’s more manageable. This raises the issue of data quality, not only in terms of the accuracy of data being input into fields in … continue reading

SD Times Open-Source Project of the Week: BumbleBee

BumbleBee simplifies building extended Berkeley Packet Filter (eBPF) tools and allows users to package, distribute, and run them anywhere.  eBPF provides Linux kernels the extensibility to enable developers to program the Linux kernel to quickly build intelligent or feature-rich functions based on their business needs. BumbleBee brings a Docker-like experience for eBPF, and through simple … continue reading

GPT-3 can now be customized to individual applications

Developers can now fine-tune GPT-3 on their own data, creating a custom version tailored to their application, which allows for faster and cheaper running of models. GPT-3 is a natural language programming tool developed by AI research laboratory OpenAI.  Users have to run a single command in the OpenAI CLI tool with the file that … continue reading

SD Times Open-Source Project of the Week: immudb

Immudb is a database written in Go that is immutable, which means that history is preserved and can’t be changed without clients noticing.  “Traditional database transactions and logs are hard to scale and are mutable, so there is no way to know for sure if your data has been compromised,” the project’s website states. “Immudb … continue reading

Talend Fall 2021 release introduces data health concepts

Talend has announced the release of Talend Fall 2021, which adds data health concepts across Talend Data Fabric. The new version includes Stitch Unlimited, which offers industry-first, non-consumption-based pricing for unlimited users and integrations. Users will also have access to a Trust Score so that everyone can know that they’re making the right decisions based … continue reading

Progress released new troubleshooting solution, Fiddler Jam

Progress today announced the general availability of Progress Telerik Fiddler Jam, designed to provide users with a troubleshooting solution for support and development teams to address customer issues remotely.  With this release, new features have become available, such as the option for video recording, capturing events during a session recording, and masking sensitive data. Progress … continue reading

Data Quality: Volume, interdependencies can create big problems

The growing mountains of data generated by organizations is literally staggering, so ensuring that data is of good quality is a huge challenge. As the SD Times Data Quality Project 2021 has revealed, one area in particular that can give companies fits is product information. In the case of the pharmaceutical industry, but likely true … continue reading

Why OpenTelemetry is driving a new wave of innovation on top of observability data

The last decade has brought a progressive transition from monolithic applications that run on static infrastructure to microservices that run on highly dynamic cloud-native infrastructure. This shift has led to the rapid emergence of lots of new technologies, frameworks, and architectures and a new set of monitoring and observability tools that give engineers full visibility … continue reading

Report: Companies prioritize securing open-source components in modern software

The rapid adoption of the cloud has led companies to increasingly secure open-source components in modern software.  The newly released 12th Building Security In Maturity Model (BSIMM12) report found a 61% increase in software security groups’ identification and management of open source over the past two years.  The report was created by Synopsys, a company … continue reading

DMCA.com Protection Status