Analytics
Analytics | News, how-tos, features, reviews, and videos
Do more with R: RStudio addins and keyboard shortcuts
Speed up your R programming workflow with RStudio addins and customized keyboard shortcuts
Optimize Apache Kafka by understanding consumer groups
Getting the most out of any Apache Kafka event streaming deployment requires a thorough understanding of Kafka consumer groups. Here’s what you need to know.
How to get started with event-driven microservices
Event-driven microservices are an excellent way to deliver both historical and new data to all of the systems and teams that need it, but they come with additional overhead and management requirements. Start small.
Making the most of geospatial intelligence
How the HEAVY.AI platform accelerates geospatial intelligence and delivers advanced analytics and real-time data visualizations that help telcos, utilities, and government agencies improve operations and minimize risk.
How InfluxDB revved up for real-time analytics
A new Rust-based database engine, InfluxDB IOx, brings an in-memory columnar store, unlimited cardinality, and SQL language support to the open source time series database, raising the bar for advanced analytics across time series...
Migrating Mastodon lists
If you move from one Mastodon server to another and you want to migrate the people on your lists, you have few options. Here’s a way to do it with Steampipe and SQL.
Preview: Google Cloud Dataplex wows
Google Cloud Dataplex is an amazingly complete system for turning raw data from silos into unified data products ready for analysis. And a bit overwhelming to learn.
Modern data infrastructures don’t do ETL
Business happens in real time but many business systems don’t. It’s time to move past client-server databases, data warehouses, and batch processes.
The Mastodon plugin is now available on the Steampipe Hub
The fediverse offers an opportunity to reboot the social web and gain control of our information diets. Steampipe and its Mastodon plugin can help you seize it.
10 best practices for every MongoDB deployment
From security musts and indexing gotchas to replication and sharding tips, follow these essential dos and don’ts to make the most of your MongoDB database systems.
Databricks launches lakehouse for manufacturing sector
The new industry-specific lakehouse could help Databricks increase lakehouse adoption, according to analysts.
What is Apache Spark? The big data platform that crushed Hadoop
Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and machine learning.
MariaDB SkySQL adds serverless analytics, cost management features
The new release of the managed database as a service removes the need for extracting, transforming, and loading data by adding a ‘serverless’ layer powered by Apache Spark SQL.
6 ways to avoid and reduce data debt
Data debt can be just as bad as tech debt, causing security and trust problems if it isn’t addressed throughout the data pipeline.
How to babysit your AI
AI systems are not yet mature and capable enough to operate independently, but they can still work wonders with human help. We just need a few guardrails.
Mastodon timelines for teams
Using Steampipe and SQL to pool Mastodon timelines and point queries and dashboards at the combined histories of teams or groups.
Tibco's Spotfire 12.2 release adds streaming and data science tools
The latest iteration of Spotfire makes it an end-to-end data visualization and analytics platform combining data science, streaming and data management tools.
How to explain the machine learning life cycle to business execs
For data science teams to succeed, business leaders need to understand the importance of MLops, modelops, and the machine learning life cycle. Try these analogies and examples to cut through the jargon.
Can AI solve IT’s eternal data problem?
New data management and integration solutions featuring AI and machine learning signal that help is on the way to meet the ballooning enterprise data challenge.
Visualizing Mastodon server moderation
Which Mastodon servers are blocking other Mastodon servers, and which servers are being blocked? We can discover them using Steampipe’s relationship graphs.