Saturday, December 20, 2025
HomeTechnologyThe Trillion Greenback Downside – O’Reilly

The Trillion Greenback Downside – O’Reilly

Image this: You’re a knowledge analyst on day one at a midsize SaaS firm. You’ve received the beginnings of a knowledge warehouse—some structured, usable knowledge and loads of uncooked knowledge you’re not fairly positive what to do with but. However that’s not the actual downside. The true downside is that completely different groups are doing their very own factor: Finance has Energy BI fashions loaded with customized DAX and Excel connections. Gross sales is utilizing Tableau linked to the central knowledge lake. Advertising has some bespoke answer you haven’t found out but. Should you’ve labored in knowledge for any variety of years, this scene in all probability feels acquainted.

Then a finance director emails: Why does ARR present as $250M in my dashboard when Gross sales simply reported $275M of their name?

No downside, you assume. You’re a knowledge analyst; that is what you do. You begin digging. What you discover isn’t a easy calculation error. Finance and gross sales are utilizing completely different date dimensions, in order that they’re measuring completely different time durations. Their definitions of what counts as “income” don’t match. Their enterprise unit hierarchies are constructed on utterly completely different logic: one buried in a Energy BI mannequin, the opposite hardcoded in a Tableau calculation. You hint the issue by layers of customized notebooks, dashboard formulation, and Excel workbooks and understand that making a single model of the reality that’s governable, secure, and maintainable isn’t going to be straightforward. It may not even be doable with out rebuilding half the corporate’s knowledge infrastructure and reaching a stage of compliance from different knowledge customers that will be a full-time job in itself.

That is the place the semantic layer is available in—what VentureBeat has referred to as the “$1 trillion AI downside.” Consider it as a common translator to your knowledge: It’s a single place the place you outline what your metrics imply, how they’re calculated, and who can entry them. The semantic layer is software program that sits between your knowledge sources and your analytics instruments, pulling in knowledge from wherever it lives, including vital enterprise context (relationships, calculations, descriptions), and serving it to any downstream device in a constant format. The outcome? Safe, performant entry that allows genuinely sensible self-service analytics.

Why does this matter now? As we’ll see once we return to the ARR downside, one pressure is driving the urgency: AI.

Legacy BI instruments have been by no means constructed with AI in thoughts, creating two vital gaps. First, all of the logic and calculations scattered throughout your Energy BI fashions, Tableau workbooks, and Excel spreadsheets aren’t accessible to AI instruments in any significant manner. Second, the info itself lacks the enterprise context AI wants to make use of it precisely. An LLM uncooked database tables doesn’t know that “income” means various things to finance and gross sales, or why sure data ought to be excluded from ARR calculations.

The semantic layer solves each issues. It makes knowledge extra reliable throughout conventional BI instruments like Tableau, Energy BI, and Excel whereas additionally giving AI instruments the context they should work precisely. Preliminary analysis reveals close to 100% accuracy throughout a variety of queries when pairing a semantic layer with an LLM, in comparison with a lot decrease efficiency when connecting AI instantly to an information warehouse.

So how does this truly work? Let’s return to the ARR dilemma.

The core downside: a number of variations of the reality. Gross sales has one definition of ARR; finance has one other. Analysts caught within the center spend days investigating, solely to finish up with “it relies upon” as their reply. Determination making grinds to a halt as a result of nobody is aware of which quantity to belief.

That is the place the semantic layer delivers its greatest worth: a single supply for outlining and storing metrics. Consider it because the authoritative dictionary to your firm’s knowledge. ARR will get one definition, one calculation, one supply of reality all saved within the semantic layer and accessible to everybody who wants it.

You is likely to be pondering, “Can’t I do that in my knowledge warehouse or BI device?” Technically, sure. However right here’s what makes semantic layers completely different: modularity and context.

When you outline ARR within the semantic layer it turns into a modular, reusable object—any device that connects to it may well use that metric: Tableau, Energy BI, Excel, your new AI chatbot, no matter. The metric carries its enterprise context with it: what it means, the way it’s calculated, who can entry it, and why sure data are included or excluded. You’re not rebuilding the logic in every device; you’re referencing a single, ruled definition.

This creates three quick wins:

  • Single model of reality: Everybody makes use of the identical ARR calculation, whether or not they’re in finance or gross sales, or they’re pulling it right into a machine studying mannequin.
  • Easy lineage: You’ll be able to hint precisely the place ARR is used throughout your group and see its full calculation path.
  • Change administration that really works: When your CFO decides subsequent quarter that ARR ought to exclude trial clients, you replace the definition as soon as within the semantic layer. Each dashboard, report, and AI device that makes use of ARR will get the replace routinely. No searching by dozens of Tableau workbooks, Energy BI fashions, and Python notebooks to seek out each hardcoded calculation.

Which brings us to the second key perform of a semantic layer: interoperability.

Again to our finance director and that ARR query. With a semantic layer in place, right here’s what modifications. She opens Excel and pulls ARR instantly from the semantic layer: $265M. The gross sales VP opens his Tableau dashboard, connects to the identical semantic layer, and sees $265M. Your organization’s new AI chatbot? Somebody asks, “What’s our Q3 ARR?” and it queries the semantic layer: $265M. Similar metric, similar calculation, similar reply, whatever the device.

That is what makes semantic layers transformative. They sit between your knowledge sources and each device that should eat that knowledge. Energy BI, Tableau, Excel, Python notebooks, LLMs, the semantic layer doesn’t care. You outline the metric as soon as, and each device can entry it by normal APIs or protocols. No rebuilding the logic in DAX for Energy BI, then once more in Tableau’s calculation language, then once more in Excel formulation, then once more to your AI chatbot.

Earlier than semantic layers, interoperability meant compromise. You’d choose one device because the “supply of reality” and pressure everybody to make use of it, otherwise you’d settle for that completely different groups would have barely completely different numbers. Neither possibility scales. With a semantic layer, your finance crew retains Excel, your gross sales crew retains Tableau, your knowledge scientists maintain Python, and your executives can ask questions in plain English to an AI assistant. All of them get the identical reply as a result of they’re all pulling from the identical ruled definition.

Again to day one. You’re nonetheless a knowledge analyst at that SaaS firm, however this time there’s a semantic layer in place.

The finance director emails, however the query is completely different: “Can we replace ARR to incorporate our new enterprise unit?”

With out a semantic layer, this request means days of labor: updating Energy BI fashions, Tableau dashboards, Excel reviews, and AI integrations one after the other. Coordinating with different analysts to know their implementations. Testing all the things. Hoping nothing breaks.

With a semantic layer? You log in to your semantic layer software program and see the ARR definition: the calculation, the supply tables, each device utilizing it. You replace the logic as soon as to incorporate the brand new enterprise unit. Take a look at it. Deploy it. Each downstream device—Energy BI, Tableau, Excel, the AI chatbot—immediately displays the change.

What used to take days now takes hours. What used to require cautious coordination throughout groups now occurs in a single place. The finance director will get her reply, gross sales sees the identical quantity, and no person’s reconciling spreadsheets at 5pm on Friday.

That is what analytics might be: constant, versatile, and truly self-service. However getting there requires rethinking how we architect knowledge methods. Within the subsequent article, “Evolving the Medallion: Information Structure within the Period of Semantics,” we’ll discover how semantic layers change the best way we take into consideration knowledge structure.

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments