Analytics

Analytics | News, how-tos, features, reviews, and videos

Do More With R [video teaser/video series] - R Programming Guide - Tips & Tricks

Do more with R: RStudio addins and keyboard shortcuts

Speed up your R programming workflow with RStudio addins and customized keyboard shortcuts

abstract data flows / data streams

Optimize Apache Kafka by understanding consumer groups

Getting the most out of any Apache Kafka event streaming deployment requires a thorough understanding of Kafka consumer groups. Here’s what you need to know.

abstract connections / network / object / root / inheritance / hierarchy

How to get started with event-driven microservices

Event-driven microservices are an excellent way to deliver both historical and new data to all of the systems and teams that need it, but they come with additional overhead and management requirements. Start small.

world map / Africa / binary code

Making the most of geospatial intelligence

How the HEAVY.AI platform accelerates geospatial intelligence and delivers advanced analytics and real-time data visualizations that help telcos, utilities, and government agencies improve operations and minimize risk.

money time clock numbers abstract

How InfluxDB revved up for real-time analytics

A new Rust-based database engine, InfluxDB IOx, brings an in-memory columnar store, unlimited cardinality, and SQL language support to the open source time series database, raising the bar for advanced analytics across time series...

paper boat sailing 154996543

Migrating Mastodon lists

If you move from one Mastodon server to another and you want to migrate the people on your lists, you have few options. Here’s a way to do it with Steampipe and SQL.

shutterstock 289153913 upward view of silver silos against a blue sky with clouds

Preview: Google Cloud Dataplex wows

Google Cloud Dataplex is an amazingly complete system for turning raw data from silos into unified data products ready for analysis. And a bit overwhelming to learn.

Abstract network of digital streams.

Modern data infrastructures don’t do ETL

Business happens in real time but many business systems don’t. It’s time to move past client-server databases, data warehouses, and batch processes.

shutterstock 61529212 engine room steam pipes and dials of steam locomotive

The Mastodon plugin is now available on the Steampipe Hub

The fediverse offers an opportunity to reboot the social web and gain control of our information diets. Steampipe and its Mastodon plugin can help you seize it.

engineer checking testing servers

10 best practices for every MongoDB deployment

From security musts and indexing gotchas to replication and sharding tips, follow these essential dos and don’ts to make the most of your MongoDB database systems.

Swedish red lakehouse

Databricks launches lakehouse for manufacturing sector

The new industry-specific lakehouse could help Databricks increase lakehouse adoption, according to analysts.

sparkler celebrate party new year

What is Apache Spark? The big data platform that crushed Hadoop

Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and machine learning.

shutterstock 359257322 SQL structured query language

MariaDB SkySQL adds serverless analytics, cost management features

The new release of the managed database as a service removes the need for extracting, transforming, and loading data by adding a ‘serverless’ layer powered by Apache Spark SQL.

statistics stats big data analytics

6 ways to avoid and reduce data debt

Data debt can be just as bad as tech debt, causing security and trust problems if it isn’t addressed throughout the data pipeline.

Cute baby-operator with laptop on a white bed 179243846

How to babysit your AI

AI systems are not yet mature and capable enough to operate independently, but they can still work wonders with human help. We just need a few guardrails.

Clock and calendar. [time / past / future / history / what's next]

Mastodon timelines for teams

Using Steampipe and SQL to pool Mastodon timelines and point queries and dashboards at the combined histories of teams or groups.

Abstract trend lines graphing change and transformation.

Tibco's Spotfire 12.2 release adds streaming and data science tools

The latest iteration of Spotfire makes it an end-to-end data visualization and analytics platform combining data science, streaming and data management tools.

Two people review information on a tablet in an office workspace.

How to explain the machine learning life cycle to business execs

For data science teams to succeed, business leaders need to understand the importance of MLops, modelops, and the machine learning life cycle. Try these analogies and examples to cut through the jargon.

shutterstock 289153913 upward view of silver silos against a blue sky with clouds

Can AI solve IT’s eternal data problem?

New data management and integration solutions featuring AI and machine learning signal that help is on the way to meet the ballooning enterprise data challenge.

3 blocking

Visualizing Mastodon server moderation

Which Mastodon servers are blocking other Mastodon servers, and which servers are being blocked? We can discover them using Steampipe’s relationship graphs.

Load More