Get the essential data observability guide
Download this guide to learn:
What is data observability?
4 pillars of data observability
How to evaluate platforms
Common mistakes to avoid
The ROI of data observability
Unlock now
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Sign up for a free data observability workshop today.
Assess your company's data health and learn how to start monitoring your entire data stack.
Book free workshop
Sign up for news, updates, and events
Subscribe for free
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Getting started with Data Observability Guide

Make a plan to implement data observability across your company’s entire data stack

Download for free
Book a data observability workshop with an expert.

Assess your company's data health and learn how to start monitoring your entire data stack.

Book free workshop

New feature: Data insights

Metaplane’s latest feature makes it easy to drill down into queries to highlight opportunity gaps for performance and cost optimization.

March 27, 2024

Product, Design

Frontend Software Engineer

March 27, 2024
New feature: Data insights

Mature data organizations can have thousands to tens of thousands of queries running per day. In addition to one-off queries used for testing and addressing ad-hoc questions, organizations frequently have huge numbers of recurring queries. These queries come in the form of data cleaning, aggregations, and other transformations required to progress data from its initial ingestion state to a format that’s more useful for the business.

Over time, however, you may start to notice that your queries are starting to run more slowly, causing ripple effects due to latency dependencies. Or perhaps your organization has grown more conscious about how it’s spending its money, and has asked the data team to do some warehouse cost cutting. Both of these needs can result in data teams scrambling to understand where and how to get the most bang for their optimization buck.

Data Insights Dashboard

Metaplane’s latest feature, Data Insights, makes it easy to understand where your team can make the highest impact when it comes time to optimize your query performance and cost. We do this by:

  • Giving over overview of cost over time to help you understand at a glance whether optimizations are needed
  • Highlighting where your costs come from–these can be viewed either at either a query or user level, including service users. This can help make your TCO more obvious.
  • Highlighting your least performant queries based on query runtime

These insights are designed to be actionable, and you can explore each insight more deeply from the drill-down view. From that view, you can:

  • Filter the data: Look at the queries generated by, for example, a single user or all of your service accounts. You can also change which time window you’re looking at, depending on if you’re looking to see larger trends or focus in on a specific moment in time.
  • Group the data: Group by type of query—such as a table creation statement—or specific tables and roles that you want to understand cost or performance for.
  • Aggregate the data: Sort queries to drill down into by total number of runs, credit usage, bytes, and other common performance metrics in your warehouse. These options provide a wide range of possibilities to understand and optimize your queries for both immediate gains as well as future incremental gains.

Query Explorer

Oftentimes there are insights you can gain by digging into the specific instances of each query, and you can easily do that from the Query Explorer. The Query Explorer shows all recent query runs for a particular query, including how much each query cost, how long it took to run, the warehouse and warehouse size it ran on, which user ran the query, and so on. From this view, you can also see daily trends over time in cost, runtime, and more.

Table-level data insights

As part of this update we’ve also overhauled the insights you can see about a particular table. Simply navigate to the “Data insights” tab for any table or view in Metaplane to see information about the query volume, the users and roles that most frequently query that table, as well as the slowest and most common queries run on that table.

How Metaplane Users Get the Most out of Data Insights

Metaplane customers are already seeing enormous value from the Data Insights dashboard:

  • One customer has used the query explorer to look at spillage to local disk for their dbt queries. This helped them determine which models may need to run on a bigger warehouse.
  • Another was surprised to find that their longest-running query took over 6 hours. By digging into what else was going on in their warehouse around that time, they were able to zero in on a queueing issue and fix it.
  • Customers using BigQuery as their warehouse were able to gain deeper insight into terabytes billed and slot usage for different types of queries, giving them deeper insights into the biggest drivers of spend.

We’re excited to hear how your team uses the Data Insights dashboard to optimize your own data stack!

More to Come

We’re just getting started with our Data Insights dashboard, and have many near-term updates planned, including an easy-to-use roll-up of what tables and views aren’t being used in your warehouse, and may be good candidates for cleanup.

If you have any feedback for other insights you’d like to see, please reach out via the in-app chat feature!

Getting started with Metaplane

If you already have a Metaplane account, you’ll be able to find data insights along the navigation bar on the left-hand side of your screen.

Otherwise, to get started with Metaplane, you can create an account or pick a time to learn more about data observability best practices from the team. New users can implement data observability within the hour, including these new data insights to improve query efficiency and optimize cost.

Table of contents

    Tags

    We’re hard at work helping you improve trust in your data in less time than ever. We promise to send a maximum of 1 update email per week.

    Your email
    No items found.
    Ensure trust in data

    Start monitoring your data in minutes.

    Connect your warehouse and start generating a baseline in less than 10 minutes. Start for free, no credit-card required.