BI Blogs

Bringing together Business Intelligence voices from across the web

OWB 10gR2 : Real Time Data Warehousing

Posted on the November 30th, 2005. Read times

Source: bayon blog [link]

There’s lots of talk about real time, right time, period batch, message based in the Data Warehousing and BI circles these days. I think this is driven by quite a few reasons. Need for fresh data, need for unified reporting interfaces for users, etc. Mostly, I think it comes down to a TCO for IT assets. As the EAI/EII/ETL tools start to converge along with increased SOA-ee-ness of databases and middleware products there becomes quite a bit of overlap between the different product sets. Managing “one product” that does this data integration, calcuation, and movement between systems costs less to maintain than “multiple products.” Truthfully, I see little strategic (ie, warehouse and marts) data that needs to be computed in real time. Those cases do exist, and OWB 10gR2 has some new features for those that do have some Real Time DW/BI needs.

There are two major flavors of mappings in support of Real Time Data Warehousing in OWB:

  • PERIODIC BATCH: This is basically a batch process that runs frequently (say every minute or so) that reads data from a QUEUE or STREAM. While the data is pushed into the DW (real time), the system only processes when run (batch). These are regular mappings that use a Stream or Queue as a source instead traditional Tables/Views/etc.
  • TRICKLE FEED: This is much closer to what most people think of when we refer to real time data warehouse. Trickle feeds involve processing each individual record as it arrives, instead of waiting for them to collect. These are a special kind of OWB mapping called Real Time Mappings that run continuously and process records as they arrive.

Truthfully I’ve only kicked the tires with both of these types of mappings limitedly. I tested some of the features back in OWB Beta2 and built a conceptual mockup of how it would work for a customer of mine. What I’m presenting is a conceptual partially working mock up built using an early beta release. In other words, do not use it as reference or consider it a blueprint for how you should proceed. If there is enough interest I might submit an article to OTN on the subject. Anyone like the idea? Better yet, if you’re not one of my customers please do consider contacting me! I’d love to help build a Real Time DW solution with OWB!

OWB now includes the ability to define, deploy, and setup Streams, Queues, Queue Tables, UserDefinedTypes, and propogations within the GUI. There’s a whole set of screens that you’ll see when the community preview hits the shelves. Unlike regular OWB deployments there are some additional requirements around streams administration locations, permissions, etc, but they are easily surmountable. Also, if you’re going to be doing real time DW you need to understand a bit about the underlying technology anyhow (not tons, but enough to know why you need to have Archive Logging turned on, etc).

Refer to the following PDF for some greater details on the conceptual, but here’s a not so good screenshot:

I’ve created a mockup of a BI solution that is fed by a CRM (Customer Data Hub perhaps) and a Subscription Management Application for this example. You can see that conceptually this involves both systems sending messages either from the APPLICATION LEVEL (JMS or some other messaging technology) or the DATABASE LEVEL (with DML Stream Captures running in Oracle). In other words, we have multiple places we can get different pieces of data and the application doesn’t necessarily have to be “REAL TIME ENABLED” to send real time data. Oracle can do that on it’s behalf using the Streams technology!

Overall what this looks like is we setup the various Streams, Capture Processes (DML), Queue Tables, and Types (based on our source tables) to support our real time system. Note that the screenshot does not include the Streams on the source system or the Capture Process definitions. This only includes the DW side Streams, Queues, Dimensions, etc.

I’ve built three real time mappings (TRICKLE FEED) which in concert receive messages to add Dimension records (SCD2) and insert new Cube records (transactions). Notice this is a greatly simplified example entirely ignoring what I consider a best practice of loading into a normalized warehouse, then updating marts based on the warehouse (a la CIF methodology). Also these are all assuming to changes (ie, record corrections) just straight clean data! We should all be so lucky!

One receives updates from the CRM application and performs SCD on the appropriate Dimension objects.

The others receives event messages from a transaction based system and inserts records into Cubes.

This isn’t quite as much detail as I would like to have gone into, and I’ll quickly repeat my warning… This is just some mockups and conceptual work so don’t expect it to be accurate come OWB 10gR2 production time! I have some more thoughts on how to use this with Partition Exchange Loading to get a days “Cubes” built realtime throughout the day, and then at the end of the day move them over to the full history but that’s a whole nother article.

This blog is part of the OWB Paris Early Review series which reviews and comments on several new Paris features.

Update, Awards and Articles

Posted on the November 30th, 2005. Read times

Source: Mark Rittman's Oracle Weblog [link]

Time for a quick update.

If you read my posting a couple of weeks ago, you’ll have seen that I’ve been
working away on a few articles recently, hence the lack of updates. It’s all a
bit embarrasing actually as the Oracle Magazine ACE of the Year 2005 article was
recently published, and I also just got an email through from SearchCRM.com to
say I’d got their "BI Blog of the Year 2005" award, and if you take a look at
the site there’s been precious few updates to the blog now for several weeks. So
if you’re a new reader - it’s not always like this, I’m just a bit busy at the
moment…

The main thing that I’ve been working on (apart from proper work, of course)
is a couple of articles for OTN and Oracle Magazine. The OTN one is on the data
profiling and cleansing features in OWB10gR2 "Paris" and is due for publication
when the product comes out, whilst the Oracle Magazine one is on the DML Error
Logging feature that’s in Oracle 10.2 and should be out in the first half of
2006. Both technologies are going to have a big impact on how we do ETL going on
from now; the data profiling and cleansing feature in Paris allows you to
automatically profile your data, derive data rules and then auto-generate
mappings to correct your data, avoiding the need to write loads of SQL*Plus
scripts, put lots of spreadsheets together and work out your own mappings. The
DML Error Logging feature in Oracle 10.2 means that you can direct-path insert
data into tables and not worry about the data load erroring and rolling back -
the error logging feature just moves the "dirty" rows into a logging table, with
details of the error, and lets the load continue. All good stuff and you should
be able to read the articles next year.

I’ve been working with the UKOUG BIRT SIG committee on the

next SIG meeting in January
and it’s shaping up to be a good one. The last
few meetings have been quite business and tools focussed and so for the January
session, we’re going to have a bit more of a data warehousing and methodology focus to the meeting. So far we’ve got the
following speakers lined up:

  • Jeff Moss, on Five Tips
    for Tuning your Data Warehouse,
  • Kevin Lancaster from Oracle EMEA on Oracle OLAP Tips & Techniques
  • myself, on

    XML Publisher
    - a talk and a demo
  • Donna Kelly,
    on Running a Successful OWB Project
  • David Walker, on Process Neutral
    Data Modelling

Not a bad line up if I say so myself. If you’re a UKOUG member book up soon
as I think we’re going to sell out this time round.

Now I’ve finished the two Oracle articles I’ve got a bit more free time,
although I should really be starting my sample chapter for the book I’m
collaborating on. I’m doing the talk on XML Publisher at the next BIRT SIG so
I’ll probably make a start on working out how that works, and I’ll try and post
some updates and findings on the blog. I’ve also got the article I promised to
write on using OLAP_TABLE with analytic workspaces, it’s a good one actually so
I’ll try and wrap it up next week. Until then, bye for now.

Chris Webb’s BI Blog: Podcast #1: Nigel Pendse

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Chris Webb’s BI Blog: Podcast #1: Nigel Pendse: “A while ago I decided that it would be cool to jump on the podcasting bandwagon, and here’s my first attempt. I was very lucky to get Nigel Pendse to chat with me about the impact that SQL2005/AS2005 will have on the BI market as a whole; if you don’t know who Nigel is he’s the man behind the OLAP Report and the OLAP Survey, both of which are great sources of competitive intelligence if you’re a software vendor or consultancy, and also a very popular speaker at conferences and seminars. He has a lot of interesting and intelligent observations to make in my opinion.”

Windows Virtualization Architecture - from Only4Gurus.com

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Virtualization is the greatest thing since sliced bread. Learn it, set it up in your bedroom, whatever, but get on that train.

Windows Virtualization Architecture - from Only4Gurus.com: “Summary: This presentation will cover the following topics: Current virtualization uses and benefits (Uses for virtualization today and Microsoft’s current virtualization offerings); Windows Virtualization Architecture (Hypervisor, Virtualization stack, Device virtualization).”

The SQL Server 2005 Coattail Effect: TDWI

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

The SQL Server 2005 Coattail Effect: TDWI: “Call it a case of both technologies riding in on the coattails of Microsoft?s SQL Server 2005 juggernaut. Deepak Puri, a BI professional with a prominent U.S. auto insurance company, says his employer (which is nominally a mixed DB2 and SQL Server 2000 shop) has been doing more on the SQL Server side of the aisle lately. To some degree, Puri attributes this to the attractiveness of SQL Server?s all-in-one BI stack. ?Our plans for SQL Server 2005 vary, [but] we’re probably looking at the BI components and XML support more aggressively than the other pieces,? comments Puri, who says his employer plans to roll out SQL Server 2005 IS, Analysis Services, and Reporting Services next year.”

Kirk Haselden : Feature Package for SQL Server 2005

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Kirk Haselden : Feature Package for SQL Server 2005: “Microsoft has released a Feature Pack for SQL Server 2005 that is a collection of very useful features including a DB2 OLEDB provider, a MOM package for SS2K5, SS2KDTS Designer components so that you can edit legacy packages inside Integration Services, the JDBC driver and the Management Objects collection which includes important object models such as SMO.”

SimonS’ SQL Server Stuff : SQL Server Fortnightly Briefings

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

SimonS’ SQL Server Stuff : SQL Server Fortnightly Briefings: “These are intended as being an overview of whats happening in SQL Server land, I think its a great idea. Whats more is they are recorded and that allows you to what at maybe a more convenient time. You can also what in a shorter period by upping the speed in Windows Media Player, and the sound is perfectly audible. Sounds like Tony’s been on the helium.”

Using SQL Server 2005 sqlcmd Utility

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Using SQL Server 2005 sqlcmd Utility: “SQL Server 2005 provides some new command line utilities. One such utility is ’sqlcmd’. The sqlcmd utility is used to run adhoc queries interactively from a command prompt window, or can be used to execute a script containing T-SQL statements. The sqlcmd utility is a great improvement over osql and isql of older releases of SQL Server. In this article, I will explain some of the features this new command line utility brings to administering SQL Server.”

The SQL Server 2005 Coattail Effect: TDWI

Posted on the November 30th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Insightful article… sometimes the focus on BI in SQL 2005 has reduced the focus on SSIS, a legitimate ETL package that comes with SQL 2005. It will take a couple of years, but SQL is going to cause some big waves in a number of areas.

0 Comments

Podcast #1: Nigel Pendse

Posted on the November 30th, 2005. Read times

Source: Chris Webb's BI Blog [link]

A while ago I decided that it would be cool to jump on the podcasting bandwagon, and here’s my first attempt. I was very lucky to get Nigel Pendse to chat with me about the impact that SQL2005/AS2005 will have on the BI market as a whole; if you don’t know who Nigel is he’s the man behind the OLAP Report and the OLAP Survey, both of which are great sources of competitive intelligence if you’re a software vendor or consultancy, and also a very popular speaker at conferences and seminars. He has a lot of interesting and intelligent observations to make in my opinion.

 

Anyway, here’s where to get the podcast (it’s about 25 minutes long and in mp3 form):

http://www.ourmedia.org/node/108782

Apologies for the rather amateurish production…

 

If you think this is a worthwhile undertaking, I’ll start thinking of people to ask to interview for future podcasts. Let me know what you think!

 

Actuate Updates Financial Performance Management (FPM)

Posted on the November 30th, 2005. Read times

Source: BI this week [link]

With its better-than-spreadsheet-spreadsheet marketing strategy, Actuate could find an eager audience for its FPM tool.

The SQL Server 2005 Coattail Effect

Posted on the November 30th, 2005. Read times

Source: BI this week [link]

Will BPM and ETL ride to mainstream success on the coattails of Microsoft?s SQL Server 2005 juggernaut?

Firstlogic and Pitney Bowes: The Data Quality Dynamic Duo That Almost Was

Posted on the November 30th, 2005. Read times

Source: BI this week [link]

What really happened?and where does Firstlogic go from here?

Restoring your Data 101

Posted on the November 29th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Restoring your Data 101: “Now that your data is backed up, you will need to learn how to restore your data when it is applicable. You might want to restore your data because of a hardware failure or restore data to a backup or standby server. The reasons are endless but the most important thing is to be prepared.”

CEO - Managing: Why Projects So Often Go Wrong - FORTUNE - Page

Posted on the November 29th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

CEO - Managing: Why Projects So Often Go Wrong - FORTUNE - Page: “It’s been 30 years since you wrote your book. Is it still relevant?

People still buy copies to give to their boss. The book is really more about management than about technology. The technology has changed immensely, so some of the old chapters are totally out of sync. On the other hand, people haven’t changed much. That’s why Homer and Shakespeare and the Bible are still relevant, because they’re all dealing with human nature. I think that’s part of the explanation for this book: The problems of managing people in teams have not changed, though the medium in which people are designing and the tools they are using have. Some people have called the book the ‘bible of software engineering.’ I would agree with that in one respect: that is, everybody quotes it, some people read it, and a few people go by it. “

My Most Recent Business Intelligence Experiences

Posted on the November 29th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

My Most Recent Business Intelligence Experiences: “Business intelligence organizations within telecom service providers will have a chance to provide a great deal of value to the industry?s changing business plans. Although there are potential pitfalls with restructuring, the rewards for business intelligence more than make up for those risks. “

Dashboards: The Key to Breaking the Dependency on BI

Posted on the November 29th, 2005. Read times

Source: OLAP/BI/IM stuff [link]

Dashboards: The Key to Breaking the Dependency on BI: “Dashboards are a sophisticated way of presenting vital information in an ‘at-a-glance’ graphical format, always with the option of drilling down into the details of the data. From a C-level perspective, dashboards are just what the doctor ordered: relevant, self-selected information that can be used to manage more effectively.”

Whitepaper: Improving the User Interface to SAP & BW by Integrating Data Into Excel

Posted on the November 29th, 2005. Read times

Source: ITPapers.com - Recent Business Intelligence / Data Warehousing White Papers [link]

Many organisations spend considerable time and effort in the implementation of the ERP system SAP R/3 and its Business Intelligence tool BW. The implementation of SAP and BW is often used to drive process improvements and efficiencies throughout the organisation. The reality is that the majority of organizations are not getting the most out of SAP and BW and are not maximising their ROI because of the time and effort it normally takes to get data into and out of SAP and BW. Microsoft Excel is a widely used and distributed business support tool - it is often seen as the tool of choice to manipulate and manage data.

Aggregates Location: A Rule of Thumb

Posted on the November 29th, 2005. Read times

Source: ITPapers.com - Recent Business Intelligence / Data Warehousing White Papers [link]

Creating business driven database level aggregates enables a certain level of client / toolset independence, data storage and protection within a specialized RDBMS and lower overall Total Cost of Ownership. Without minimizing the role of the client systems in a Data Warehousing environment, it is quite obvious that the first candidate for aggregates deployment is the Data Warehouse Database, regardless of the type of client system that will ultimately present the data to the end-user.

Designing a Corporate Information Factory Using the Zachman Architecture Framework

Posted on the November 29th, 2005. Read times

Source: ITPapers.com - Recent Business Intelligence / Data Warehousing White Papers [link]

This paper presents a method of applying the order defined by Zachman’s Framework to build the most complex components of the Corporate Information Factory. The same method can be applied for the structured definition of other systems that can be part of the CIF, like Data Marts, Metadata Repository, MOLAP cubes and so on. There are, of course, other methods and methodologies to define a Corporate Information Factory. But the Zachman’s Framework implementation was successfully tested by Open Data Systems’ consultants and proved to shorten the time and project costs with more then 10% over other methodologies.

Next Page »