Aggregation splits data into subsets, computes summary statistics on each subset, and reports the results in a conveniently summarized form. The aggregate function is one of the most capable functions in the scidb package. The package overloads R’s standard...

How to Install SciDB Watch the video to get step-by-step instructions for installing SciDB on one or more servers. Accompanying slide deck for installing SciDB: HowToInstallSciDB slides Don’t have SciDB installed yet? Click here to get it...

This post is a continuation of the previous post, Windowing Operations over Timeseries Data in Paradigm4. Repeated from part 1 of the post:The SciDB array, showing elevation and time as dimensions Now, let’s look at how this array could be represented in SciDB, as...

Windowing operations—aggregating functions over a rolling subset of data—are useful in many applications. For example, rolling average calculations can help smooth over short-term fluctuations, thereby revealing long-term trends. Sensors used for testing global...

Some customers have inquired about loading files from HDFS. Current versions of SciDB support this. Remember, HDFS is not a specific file format, it is a file system that uses distributed storage. HDFS can store CSV files. Although HDFS files are not directly visible...

We get asked about hardware configuration for SciDB clusters. We start with a rule of thumb for the ratios among SciDB instances, CPU cores, RAM, and disks. That rule of thumb is that each SciDB instance should have: 1-2 CPU cores :: 4-8 GB RAM :: 1 Disk. For example,...

About Paradigm4

We are the company behind SciDB. Paradigm4's SciDB is designed for mining insights from genomic, clinical, trading, image, sensor, and device data. Paradigm4 is changing what’s possible with Big Data analytics, enabling businesses to answer bigger—and harder—questions, accelerating the creation of new discoveries, products and services.