MYSQL Database for DW Initiative

Gurus,
I am exploring a possibility of using DB2 I5 Series, MYSql or MS SQL Server to deploy a small DW (ODS). I expect the warehouse to be about a 1 or lesser that 1 TB. We are left w/ these three datatabases as the Customer has already some kind of relationship established w/ the RDBMS Vendors. The data model is mostly normalized and may use Star Schema.
We already have ETL and reporting BI tools in place.
Could you provide some of these details on MySQL database?
1. Scalability
2. Ability to handle data in third normal form for a DW initiative
3. Ability to handle complex join conditions
4. Interface w/ a web accessed reports5. Performance degradation/improvements in a simulteneous data access
6. Ease of deployment

I believe, we could find more about DB2 I5 series database and MS SQL Server.

If budget is a constraint, why don't u try MySQL Enterprise Subscription and
then build a DW, since u can get Enterprise Class support from MySQL direct
in case of any issues in planning or deployment.

Here's the situation, the customer was baught over by another company who have MYSQL database.
The direction that they are going is MYSQL but as a DW Consulting architect, I wanted to have some benchmarks on MYSQL.

I believe the customer has already licencese for Pentaho BI Suite.
I have seen this utility in past and does what it is supposed to do.

I have worked with mySQL as a backend database
for some web apps written in PHP and have
recently been investigating its use with, for
instance C# and ASP.NET for other web
apps. mySQL seems to work great with PHP;
however, I have had all manner of issue with
trying to get ODBC components for using mySQL
with Delphi, C#, and various other development languages.
With regard to scalability, I have not done any
studies on it but I would expect, based upon what
I know of mySQL, that it would scale well up to a
point but, frankly, I would be concerned about a
Tbyte of data. Also, there are some other facets
that would discourage me from using it . . .
essentially, there are some aspects of
implementation of SQL and database functionality
that are not quite as well developed in
mySQL. While mySQL should handle 3rd normal as
well as almost any, I would be concerned about
the complex joins and, in general, the
connections that would be needed to accomplishing
the web accessed reports along with a significant
number of simultaneous accesses.
Personally, I would recommend using SQL Server rather than mySQL or even DB2.
At 03:03 PM 12/15/2007, you wrote:
>
>[]
>
>
>
>Gurus,
>I am exploring a possibility of using DB2 I5
>Series, MYSql or MS SQL Server to deploy a small
>DW (ODS). I expect the warehouse to be about a 1
>or lesser that 1 TB. We are left w/ these three
>datatabases as the Customer has already some
>kind of relationship established w/ the RDBMS
>Vendors. The data model is mostly normalized and may use Star Schema.
>We already have ETL and reporting BI tools in place.
>Could you provide some of these details on MySQL database?
>1. Scalability
>2. Ability to handle data in third normal form for a DW initiative
>3. Ability to handle complex join conditions
>4. Interface w/ a web accessed reports5.
>Performance degradation/improvements in a simulteneous data access
>6. Ease of deployment
>
>I believe, we could find more about DB2 I5 series database and MS SQL Server.
>
>Thanks,
>Vinay Bagare

May I recommend you examine Postgres in detail too. We replaced our
oracle installation with it and have been more than satisfied. Its
totally open source and performs very well. No, I don't work for them
;-)
Tim Clarke

Microsoft SQL Server 2005 is a good choice. It has incredible features in DW & BI.

SQL Server 2005 makes it simpler and easier to deploy, manage, and optimize enterprise data and analytical applications. As an enterprise data management platform, it provides a single management console that enables data administrators anywhere in your organization to monitor, manage, and tune all of the databases and associated services across your enterprise. It provides an extensible management infrastructure that can be easily programmed by using SQL Management Objects, enabling users to customize and extend their management environment. SQL Server 2005 is a complete BI platform that provides the features, tools, and functionality to build both classic and innovative kinds of analytical applications.

I know about the database that I am currently working on (Teradata).
But I am unaware about requirements for hosting a DW on SQL server.
For about 1000 users and w/ less than .5 TB data, what specifications of SQL Server should I be looking at ex. CPU's, memory, OS, etc.
Could you give me some inputs on this please?

Copyright 1998-2015 Ziff Davis, LLC (Toolbox.com). All rights reserved. All product names are trademarks of their respective companies. Toolbox.com is not
affiliated with or endorsed by any company listed at this site.