Brainspace Discovery5

The industrys most advanced platform

for E-Discovery and Text Analytics.Brainspace Discovery 5 is the most comprehensive solution for analyzingunstructured data. From intelligent data preparation to exploration and collaboration,business professionals, analysts, and domain experts can use Discovery 5 to makedecisions, evaluate and mitigate risks, and uncover insights and opportunities.

VISUAL ANALYTICS

CONCEPT SEARCH

COMMUNICATION ANALYSIS

Built onBrainspace,a patented machineintelligence thatdynamicallyidentifies and relatesconcepts from anytext source.

A BOLD STEP FORWARD

Rather than creating just another semantic search solution, Brainspacechose to rethink the method from the ground up. This new approachhad to rise above the challenges and limitations that have hinderedthe other semantic search solutions on the market today Brainspacesemantic search had to be different. Applied first in e-Discovery,Brainspace Discovery had to learn from all of the documents loadedfrom a case every single one. It had to scale to provide semanticsearch on extremely large and often changing data volumes as areencountered in early case assessment. Most importantly, BrainspaceDiscovery had to answer our clients requests for completetransparency with control of each query it could not be a black box.Having met and exceeded the exacting standards in e-Discovery,Brainspace is now an industry-leader offering a fully transparent, highlyscalable, federated semantic search platform capable of handling theworlds largest unstructured datasets.

THE PERFECT COMBINATION OF MAN AND MACHINE

DOCUMENT CLASSIFICATION

Lets start with the machine. Brainspaces patented machine

intelligence dynamically identifies and relates concepts from any textsource and does so at a scale unmatched by any technology in theindustry.Now the important part User Experience. What makes BrainspaceDiscovery so unique is the reimagining of search as a conversationbetween the machine intelligence and the human user. To make thispossible Brainspace has developed several innovative, interactivevisualizations.

brainspace.com/discovery

BRAINSPACE DISCOVERY 5PRODUCT DATA SHEET

SCALABLE POWER WITH ULTIMATE CONTROL

TRANSPARENTNo black boxes here. Each query is enhanced by themachine intelligence and shown to the user for theircomplete understanding.FULLY FEDERATEDWe can make your existing database smarter. We canalso formulate semantic queries to run across virtuallyany index including internal document populations,intranets, extranets, enterprise content managementsystems, portals, email archives, and patent data.

COMPLETE USER CONTROL

Not only is our semantic search transparent, but usersare given the ability to require, ignore, increase ordecrease the importance of all query words in a uniquevisual query interface.HIGHLY SCALABLE SEMANTIC TECHNOLOGYUnlike other semantic technologies that must samplesmaller sets of documents for learning, our largestsemantic index currently includes learning fromhundreds of millions of documents

INTEGRATED CONNECTIONSINGEST DATA FROM COMMON PLATFORMSSeamlessly transfer data back and forth through fullintegrations to systems such as Relativity and Nuix.

IMPORT LOAD FILES

Using our simple import utility, you can load delimitedtext files such as DAT and CSV directly into Discovery.

The only analytics platform with a Brain.

BRAIN

BUILD

REDUCING NOISEDUPE AND NEAR-DUPE DETECTIONBrainspace Discovery detects and marks allduplicates and near-duplicates during ingestion bycomparing the body text of documents. This allowsBrainspace to identify documents with the sameor very similar content even if the native formats ofthe documents are different. With the duplicate andnear-duplicate documents in your dataset clearlyidentified and grouped together, you can focus youranalysis on unique documents without having tomanually review and identify duplicative information.Typical dedupe processes rely on meta-fields, butthe Brainspace methodology goes much further.

BOILERPLATE DETECTIONBrainspace Discovery automatically identifiesboilerplate text in your documents and ignoresit for clustering and learning. This ensures thatsemantically unrelated content does not distortthe intelligence that Discovery gains fromyour documents. Boilerplate detection occursautomatically, however Brainspace also providesthe ability to configure boilerplate detectionsettings, allowing you to specifically target text forinclusion or exclusion. This automated processsignificantly reduces the amount of operational timeneeded to get cases up and running.

brainspace.com/discovery

BRAINSPACE DISCOVERY 5PRODUCT DATA SHEETBRAINSPACE ANALYSISCLUSTERINGWe use a combination of hard clusteringalgorithms (where every document is assigned toa single cluster) and other propriety methods toproduce a binary cluster tree. We later reshapethis tree to a larger arity, also injecting syntheticclusters to create a cluster tree that better reflectsthe vocabulary found in the clusters.PATENTED PHRASE DETECTIONBrainspaces patented Automatic PhraseIdentification technology detects and createsphrase on the fly. Furthermore, it identifies wordcombinations with semantically similar context andcreates them as phrases. Brainspace achievesall this automatically, without relying on lexicons,synonym lists, thesauri, or phrase lists. This allowsfor the detection of unique phrases not in lists,therefore increasing accuracy and relevance.

PATENTED MULTI-CONCEPTOur patented Multi-Concept Detection technologyrecognizes and draws inferences from multipleconcepts. In environments with multiple brains,each concept can access a different brain to drawrelevant inferences. Unlike other solutions thatdepend on traditional Latent Semantic Analysis(LSA) that suffers from high recall and low precision,Brainspaces innovative approach that builds uponLSA with proprietary improvements significantlyincreases the accuracy without sacrificing recall.FOREIGN LANGUAGEBrainspace offers the ability to work with cases thatspan multiple languages. Brainspace is multi-lingualout of the box including CJK.PEOPLE AND ORGANIZATIONAL GROUPINGBrainspace extract and reconciles all domains,senders, recipient and custodians

BRAINSPACE IDBrainspace Discovery uniquely identifies eachdocument and ranks them based on their textualcontent, effectively grouping semanticallysimilar documents together. Using this powerfulidentification number, you can accelerate yourreview by smartly batching out documents basedon their semantic relevance.

EXACT DUPLICATE CLUSTER ID

Brainspace automatically clusters togetherdocuments that are exact duplicates of each otherand assigns a unique identifier to each exactduplicate cluster. This information can be exportedout in a report or to another platform, allowingyou to quickly identify documents that are exactduplicates of each other even when you are not inBrainspace.

EMAIL THREADINGUsing Discoverys Email Threading feature, you canincrease your review times by reviewing only themost inclusive email in a thread.NEAR DUPLICATE CLUSTER IDBrainspace automatically clusters togetherdocuments that are near duplicates of each otherand assigns a unique identifier to each nearduplicate cluster. This information can be exportedout in a report or to another platform, allowingyou to quickly identify documents that are nearduplicates of each other even when you are not inBrainspace.CLUSTER RELATED SET IDBrainspace automatically clusters together multipleexact duplicate and/or near duplicate clustersthat are lexically similar to each other. Each parentcluster gets assigned a unique identifier, which canbe exported out in a report or to another platform.This allows you to quickly identify documents thatare in the same parent cluster even when you arereviewing the data set outside of Brainspace.

VISUAL ANALYTICSTHE POWER OF A 360 VIEWBrainspace enables a truly unique Visual Analysis experience by Linking Multiple Views of Data in context including:Transparent Concept Search, timeline, clustering, communication analysis, and any structured data elements as well.DASHBOARDFrom a zero-state with no humandecision-making, the Dashboardpresents the essence of yourdocuments. Every element on thedashboard is interactive, so with just afew clicks you can greatly narrow thefield of investigation. Youll notice thatas searches are executed, documentson the far right match, and thedashboard itself changes to reflect thenarrowed population. The Dashboardmakes it easy to find meaning in eventhe largest datasets.

brainspace.com/discovery

BRAINSPACE DISCOVERY 5PRODUCT DATA SHEETDUPLICATE VIEWA visual representation of the totalvolume of data including duplicates,near duplicates, and originaldocuments. This view dynamicallyadapts to clicks, searches, and filters.

TIMELINE VIEWSee the volume of trends throughoutthe entire timespan of a datset,identify anomalies such as peaksand troughs, and drill into specificmonths or days for deeper analysis.

FACET TABLESBrainspace Discovery automaticallybuilds tables using available facetswithin your dataset and displays keyinformation such as top terms, topindividuals and top domains.

FOCUS WHEELBrainspace automatically groups documents in a wheel ofhierarchical clusters, each containing lexically connecteddocuments. Each document is placed into only one clustertogether with documents similar based on the meaning oftheir text. Each cluster is identified with the main topics thatconnect the underlying documents. The wheel provides youwith a birds-eye view of the data set and allows you tointeractively navigate and zoom into it to identify topicsof interest, which then can be expedited for review. In asimilar fashion you can identify areas of little interest thatcan be deprioritized.CREATE FOCUS WHEELS ON THE FLYYou can create a focus wheel on demand based onany group of documents resulting from a concept search,communication chain, or even another document cluster.With this powerful feature you can quickly visualize your dataand identify other topics of interest that you may not havethought about.COMPASS NAVIGATIONAfter finding an exemplar document, Brainspace can easily navigate to the cluster of interest. Just click onthe compass navigator and it will zoom to the document in the Focus Wheel. Zoom out to quickly identify thedocuments neighborhood. This is extremely powerful for investigation and prioritizing review.

TRANSPARENT CONCEPT SEARCH

Using Brainspace Concept Search functionality youcan use a word, phrase, or even a whole document torun searches against data sets in excess of hundredsof millions of documents to identify related terms.Brainspace achieves this by using patented technologythat transformed traditional Latent Semantic Analysis(LSA) including Automatic Phrase Detection and MultiConcept Detection.Brainspaces developed a patented method foridentifying multiple concepts in a corpus of text,extracting explicit and inferred terms from the Brain.Typically, semantic technologies average all concepts ina corpus - attempting to find a semantic center. This is asignificant deficiency and departure of how ideas

BRAINSPACE DISCOVERY 5PRODUCT DATA SHEET

TRANSPARENT CONCEPT SEARCH (cont.)

are understood and communicated. Brainspace multiconcept queries enable a more human-like learning fromany unstructured text, extracting multiple themes insteadof an artificial average.DEFENSIBLE CONCEPT SEARCHBrainspace has the capabilty to take any concept searchand turn it into a boolean expression. Unlike alternativeBlack Box concept search tools, this transparencytakes the guess work out of concept expansion anddelivers a versatile and defensible platform for attorneys.ADJUSTABLE TERM WEIGHTSEach query is expanded to a list of the most relevantinferred words and phrases (concepts). By usingthe slider under each term and phrase, the user caninfluence the intelligence, indicating importance,required terms or ignored terms, zeroing in on highlyrelevant results. Brainspace allows you to interactivelymanipulate the list of related terms and their weightsusing the search term sliders. The result is a transparentand defensible query using an explicit list of terms thatcan return exactly the documents of interest.

INTELLIGENTLY FILTER YOUR SEARCH RESULTS

You can quickly cull your search results by usinga combination of filters including email addresses,keywords, and domains.KEYWORD EXPANSION LISTAfter each concept search Brainspace returns a listof keywords with an associated relevance weighting.This feature allows a user to expand Keyword lists andensure they dont miss potential keywordsRELEVANCE RANKINGDiscovery ranks your search results by conceptualrelevancy based on your search criteria, allowing you toprioritize your analysis and review process.ADVANCED SEARCHUsing the Advanced Query Builder, automaticallyconstruct Lucene queries by combining any number offields (metadata and those created by Brainspace) withBoolean operators. By using the query editor you canmanually leverage powerful Lucene functions such aswildcard searches, fuzzy searches, proximity and rangesearches, term boosters and field groupings.

COMMUNICATION ANALYSISCommunication Analysis interface provides youwith an interactive map of all communication thattook place in your data set. Using CommunicationAnalysis, you can:

View people-level or domain-level map of all

communication traffic around your domain offocus.Immediately identify top sender, top recipients,and top terms.Analyze conversations between two or moreindividuals or domains including email trafficand topics discussed.Dynamically filter your view using a variety ofattributes including To, CC, BCC, direction oftraffic, and recipient count.Group to group communications- you can easilymonitor communications between regulatedand non-regulated divisions within yourcompany and ensure they are in compliance.

brainspace.com/discovery

BRAINSPACE DISCOVERY 5PRODUCT DATA SHEETACCLERATING REVIEWBRAINSPACE IDBrainspace Discovery uniquely identifies each documentand ranks them based on their textual content,effectively grouping semantically similar documentstogether. Using this powerful identification number,you can accelerate your review by smartly batch outdocument based on their semantic relevance.EMAIL THREADINGUsing Discoverys Email Threading feature, you canincrease your review times by reviewing only the mostinclusive email in a thread.PREDICTIVE CODINGBrainspace Discoverys Predictive Coding feature usesour patented machine learning technology togetherwith Logistic Regression to help you review less anddecrease your associated costs. Unlike other black-boxsolutions, Brainspace gives you more control by allowingyou to set your target recall at the beginning and allowyou to adjust it by providing you feedback on depth forrecall performance throughout the process. Furthermore,Brainspaces Active Learning methodology automaticallyidentifies and selects not only those documents thatthe classifier is most uncertain about, but also uncertaindocuments that are diversified across the collection,and provide insight into concentrations of interestingmaterial.

Brainspace offers predictive accuracy that is at

or close to the best across a range of publishedliterature and internal experimental results. [Beyondnaive Bayes (e.g. Autonomy) and decision trees.]Brainspace is quick to build its model and predictthe relevance of target documents. (e.g. quicker thannearest neighbor approaches like Content Analyst.)Brainspace provides prioritized ranking over thetarget documents by decreasing predicted degreeof responsiveness, not just binary partitioning.Brainspace provides reasonable predictions of theprobability that a document is relevant and a degreeof human interpretability; that allows, for instance, thesystem to highlight those parts of a document thathave led to a prediction of responsiveness.Unique Active Learning Methodology maximizesthe value of each round assigning reviewers thedocuments that have the best opportunity to train theclassifierBrainspace Depth of Recall Measure shows howmany documents must be reviewed to be done.

COLLECTIONSAny search, cluster, or document set can be taggedand placed into a collection. Collections can be namedon the fly or set up prior to a case. Like folders or tags,responsive documents within your data set can be easilycategorized, organized and shared with a click of themouse.DOCUMENT VIEWSHIT HIGHLIGHTINGAll relevant concepts that exist within the body text of adocument are highlighted. You can quickly navigate tothe highlighted portions to gain a deeper insight into thecontext the terms were used within.DOCUMENT TOPICSProminent topics that appear in a document areidentified and displayed, thus allowing you to get a quickunderstanding of a document without having to read it inits entirety.

DOCUMENT METADATAAll metadata that has been a part of the documentsince its creation as well as those that were created byDiscovery are displayed, providing a complete picture ofa documents history.

The only analytics platform with a Brain.

Discovery 5 is powered by Brainspace, the industrys most advanced, large-scalemachine learning platform. Brainspace rapidly ingests millions of pages of unstructured text,dynamically learning without taxonomies or ontologies. This learning is surfaced throughadvanced, interactive visualizations, giving the full power of Brainspace to every user.

DISCOVERYAPPLICATIONSERVER

DISCOVERYANALYTICSSERVER

Discovery 5 takes the large-scale

learning of Brainspace and puts itat the fingertips of users withbeautiful, interactive visualizations.Quickly and easily navigate largedatasets to uncover connectionsand learn more in less time.

TRANSPARENTCONCEPT SEARCH

VISUALANALYTICS

COMMUNICATIONANALYSIS

DOCUMENTCLASSIFICATION

Discovery 5 boasts the industrys

best and fastest documentclustering technology. Discoveryenables users to cluster documentresults on the fly, revealing newinsights in minutes (not hours).

Key Features of Discovery 5

BRAINSPACELEARNINGSERVER

APIMASSIVELY SCALABLE MACHINE LEARNING

The Brainspace Learning Server dynamically constructs

brains from billions of pages of unstructured content,automatically detecting and relating concepts, as wellas de-duplicating and clustering documents.

Designed with scaleablity in mind, Brainspace can

distribute its most data intensive processes acrossmultiple servers.

BEST-IN-CLASS CONCEPT SEARCH

Use a sentence, paragraph or page of text to retreiveconceptually related documents, ranked by relevanceor contextual distance.