dtSearch Release Notes

Disclaimer: This information is provided subject to the license agreement accompanying the
products described (if acquired with the product) or to the terms of use of this web site, and
does not constitute an additional warranty. dtSearch makes no warranty of any kind with regard to this information.

Upgrade Information dtSearch indexes created by older versions from 6.0 forward can be used by newer versions.
Therefore, you will still be able to use your existing indexes after upgrading to a newer version.
However, file parser changes between versions may cause hit highlighting to appear incorrectly in retrieved documents
when a different dtSearch version is used to search an index than was used to create the index. To correct this, rebuild
the index with the current dtSearch version.

dtSearch 7.79 (Build
8232 )
Fri 02/20/2015

All products

Fixes and minor enhancements

Fixed bug affecting cancellation of file conversion after either expiration of FileConverter.TimeoutSeconds
or when OutputStringMaxSize exceeded when processing
large binary input files with the dtsConvertInlineContainer flag.

File parser bug fixes affecting: *.xlsx, .pdf, .doc, .msg

Added FileConverter.SetIndexCache() API to an IndexCache to be used with file conversion.

Other bug fixes.

dtSearch 7.78 (Build 8215) Released October 29, 2014

Fixes and minor enhancements

Fixed bug affecting cancellation of file conversion after either expiration of FileConverter.TimeoutSeconds
or when OutputStringMaxSize exceeded when processing
large binary input files with the dtsConvertInlineContainer flag.

Fixes and minor enhancements

Added support for indexing Outlook 2013 and 2010 OST files. Note: Microsoft has not officially documented the OST file format
specification, so this support is based on unofficial non-Microsoft information about the OST file format.

dtSearch 7.74 Build 8166 Released December 30, 2013

All products

dtSearch Desktop/Network

Added support for indexing Outlook emails and other content using 64-bit versions of Microsoft Office. A 64-bit version of mapitool is also included.

dtSearch Web/Publish

Updated search form templates for dtSearch Web and dtSearch Publish. A new drop-down list in the "Build Search Form" dialog box lets you pick the template to use. The updated templates include
frameless options and new HTML5 elements such as the calendar control for date searching. Some examples of the new search
forms are posted here.

dtSearch 7.73 Build 8139 Released October 7, 2013

Fixed browser compatibility bug in dtSearch Desktop 7.73.8138 only causing problems with the "Next Hit" button on some systems.

dtSearch 7.73 Build 8138 Released September 30, 2013

All products

Support for some older Windows versions is discontinued in dtSearch 7.73. Supported: Windows Server 2012, Windows Server 2008, Windows Server 2003 SP 2, Windows 8, Windows 7, Windows Vista, Windows XP SP 3.
Not supported: Windows 2000, Windows ME, Windows 98, Windows 95, and Windows XP versions without SP 3.
For more information on operating system compatibility please see this Microsoft article: http://msdn.microsoft.com/en-us/library/vstudio/jj851139.aspxThis change is a result of our transition to Visual Studio 2012, using the v110_xp platform toolset, to build all Windows products.
The older .NET API wrappers (dtSearchNetApi2.dll and dtSearchNetApi3.dll) are still built using older compilers for compatibility.

dtSearch Web/Publish

Note: Updated search form templates for dtSearch Web and dtSearch Publish, previously included in dtSearch 7.73 beta builds, have been deferred to version 7.74 to allow time for additional browser compatibility testing.

dtSearch Engine

Added dtsConvertUseStyles flag in the
ConvertFlags enumeration, to provide a way to use CSS styles to format content.

Added FileConverter.DocTypeTag, to provide a way to specify a DocType in HTML output.

Added detection of Windows PE and NE executables and Linux ELF executables (these formats are still indexed according to the
binary files setting, with content either filtered or skipped)

Fixed bug causing use of dtsExoDoNotConvertAttachments in FileConverter.ExtractionOptions to generate an incorrect "File Encrypted" error for
some documents during file conversion (not indexing or searching).

Fixed bug causing email headers to be indexed even if dtsoSkipEmailHeaders flag is set if filetype.xml set up to index message bodies separately from attachments.

Fixed bug affecting nested proximity searches where a matching set of the terms exists exactly once and
are preceeded (by the exact proximity range) by two or more instances of the proximity terms that
do not satisfy the proximity criteria.

Other bug fixes.

dtSearch 7.71 Build 8080 Released November 30, 2012

dtSearch Engine

New support for highlighting hits using different colors for each search term when highlighting hits
using FileConverter.
For API documentation on this feature, please see the article "Highlighting each term using different attributes"
in dtSearchApiRef.chm. Multicolor hit highlighting is not supported for PDF files displayed in Adobe Reader.

Added new options in dtSearch Desktop to (1) hide MIME headers in emails, (2) show properties of images embedded in documents,
and (3) control whether paths are indexed
along with filenames when the "Index filenames as text" options is enabled. These options are in the Options > Preferences > Indexing Options
dialog box.

New single-document option for indexing Access (*.mdb, *.accdb), XBase (*.dbf), and Comma-separated values (*.csv) files.
By default, dtSearch indexes each record of database files (*.mdb, *.accdb, *.csv, *.dbf) as a separate document.
This new option provides a way to index all records in a database file as a single document.
For more information, see dtSearchApiRef.chm (Overviews > Databases and Fields > Database files (*.mdb, *.dbf, *.csv))

Added dtsoFfShowImageProperties flag in Options.FieldFlags to display image properties (such as EXIF data) for images embedded in documents. Image properties
are always indexed for images in seperate files. This flag only affects images embedded in documents, such as a .jpg embedded in a Word file.
A related change, made for consistency, affects the hanlding of image files embedded in .eml email files. Previously these properties
were always extracted. Now they will only be extracted with the dtsoFfShowImageProperties flag is set, so .eml files will be handled
consistently with other file formats.

Fixes and minor enhancements

Eliminated use of FILE_FLAG_RANDOM_ACCESS, which could cause excessive memory consumption under Windows Server 2008 because of what appears to be a
bug in Windows caching behavior (see http://support.microsoft.com/kb/2549369 for
more information).

Zlib version updated to 1.2.7

dtSearch.Spider2.dll and dtSearch.Spider4.dll have new dependencies on zlib DLLs zlib_wapidll_{VC8/VC10}_{32/64}.dll to handle
gzipped sitemap.xml files.

Added file parsers for Ichitaro word processor versions 5 and later.

File parser bug fixes affecting
MSG,
PDF,
DOCX,
PPTX,
Excel 2,
RTF

Message attachments to MIME emails are now indexed as attachments (so they can be handled consistently with other attachments
in the new attachment-related features described above) rather than being merged with the text of the message.

Added reporting of PDF files that do not contain any page text. In dtSearch Desktop, these will appear in the
index log with "Image Only" after the type name (click View Log in the Update Index dialog box to see the
log of indexed files). In the API, the flag fiImageOnly will be set in IndexFileInfo (.NET, Java) or
dtsIndexProgressInfo.fileInfoFlags (C++) during indexing.

Removed extra path information from headers in containers converted to text using FileConverter.exe or
FileConvertJob with the dtsConvertInlineContainer flag

Removed "Document Properties" caption from Word, PowerPoint, and Excel 2003 file properties. For applications
that require this flag for backward compatibility, use the new flag dtsoFfIncludeDocumentPropertiesCaption in Options.FieldFlags

Added new values to SearchReportJob.Header and SearchReportTemplate.rtf: %%Ordinal%%, %%DocId%%, %%Type%%

Added new dtsConvertIncludeBOM flag to FileConverter.Flags to add UTF-8 BOM to UTF-8 output

Fixed too-long filenames generated for items extracted from PST files (names could be too long for some file systems when copied using Edit > Copy File in
dtSearch Desktop)

Other bug fixes.

dtSearch 7.68 (Build 8025) Released December 5, 2011

All products

Outlook and MIME email files have a new, simplified header that will be consistent between the two formats
with these fields: From, SentVia (for emails sent using a mailbox other than the "From" address), To, CC, BCC, Subject, Date.
The footer for these formats will include these additional fields for backward-compatibility:
Sender (combines From and SentVia); Recipient (combines To, CC, and BCC); SentDate (msg files only); DeliveredDate (msg files only).
The Sender and Recipients fields provide a way to search across all senders or all recipients of a
message. These fields will also allow searches using the field names used in older versions, such as "Sender contains example", to
continue to work.
Documents indexed with dtSearch versions 7.67 and earlier will be displayed with the old header so the change will
not cause hit highlighting problems with previously-indexed emails.

dtSearch Engine

Includes new (August 9, 2011) security updates from Microsoft Bulletin MS11-025.
For more information on this Microsoft security update, including updated dependencies for
dtSearch Engine developer components, please see this article.

Added dtsoFfIndexArchiveFileLists flag.
This option adds a searchable file named ArchiveFileList.html to
ZIP and RAR archives during indexing. The original file is not modified
but the ArchiveFileList.html file is searchable as if it were part
of the ZIP or RAR file. The file consists of a list of the names
of the files inside the archive.

dtSearch 7.67 (Build 7973) Released June 1, 2011

All products

Includes security updates from Microsoft Bulletin MS11-025.
For more information on this Microsoft security update, including updated dependencies for
dtSearch Engine developer components, please see this article.

Added support for indexing PST files directly. Because Outlook locks the PST file that is currently in use, this will not work
with the PST file that you are actively using in Outlook, and is primarily for use
in situations where archived or forensically-obtained PST files are being searched.

dtSearch 7.66 (Build 7936) Released January 25, 2011

dtSearch Engine

Added dtsSearchFastSearchFilterOnly search flag to enable much faster, optimized generation of a SearchFilter from
a search when no other output is required from the search.

Added WordListBuilder.GetLastError to the C++, Java, and .NET APIs to provide better reporting of errors
resulting from WordListBuilder calls.

Added new flag to enable caching of field values in WordListBuilder to make ListFieldValues calls faster.
The flag is dtsWordListEnableFieldValuesCache (in the WordListBuilderFlags enumeration) and is passed to WordListBuilder
using the new SetFlags method.

Added a way to disable file parsers using the file type table (filetype.xml) by setting the TypeId to the
id of the parser to disable and the Flags value to 2.

Added a mechanism for a dtsInputStream to simulate an I/O error by returning a negative value from read() of less than
10,000. When this occurs, dtSearch will interpret it as an I/O error and halt processing of the current input
file immediately, reporting an I/O error through the API.

dtSearch 7.65 (Build 7906) Released August 20, 2010

dtSearch Engine

Added dtsoFfSkipEmailHeaders flag for Options.FieldFlags to suppress searching and display of headers in MIME and Outlook messages

Fixes and minor enhancements

Reduced memory requirements for parsing very large XLS files

Fixed bug that allowed XML output from saved search results and XML generated by conversion to
it_ContentAsXml to contain the colon (":") character in tag names, which caused the generated XML
to fail validation.

dtSearch 7.64 (Build 7876) Released March 15, 2010

dtSearch Engine

Added dtsSearchLanguageAnalyzerSynonyms flag to enable using a language analyzer to generate morphological variations on a
search term at search time. When this flag is set, the language analyzer is called for each word or phrase in the search request.
The flag dtsLaInputIsSearchTerm is passed to the language analyzer in dtsLaJob.flags, so the language analyzer knows why it is being called.

Added dtssGetWordBreaker API function to provide direct access to the dtSearch Engine's internal word breaker using the
language analyzer API. For sample code demonstrating how to use this API, see the WordBreak example in examples\vc8\WordBreak.

Added more structural information to the output generated by conversion to the it_ContentAsXml file format.

Fixes and minor enhancements

Fixed incorrect hit highlighting when Unicode Filtering options at search time different from options
used to index a file. To ensure consistent options, Unicode Filtering options are stored in the index
when the index is created, in the index_a.ix file.

dtSearch 7.63 (Build 7836) Released October 29, 2009

Fixes

Compatibility notes for developers working with the .NET 2.0 API only

In dtSearch 7.63, the
DLL dependencies for dtSearchNetApi2.dll have changed due to the release of the Visual Studio .NET
2005 Service Pack 1 Security Update for ATL. Because dtSearchNetApi2.dll is built with
the updated version of Visual Studio .NET 2005, it requires the updated MFC and CRT DLLs that are included with
that version.
This issue does not affect any other dtSearch Engine API.

dtSearch 7.63 (Build 7835) Released October 12, 2009

dtSearch Engine

Added IndexFileInfo.UserFields in .NET API to provide access to stored fields through the IIndexStatusHandler
callback interface during indexing.

Added dtsnIndexDeletedFileRemoved, dtsnIndexListedFileRemoved, and dtsnIndexListedFileNotRemoved notifications
to the indexing status callbacks to notify the calling application when files are removed from the index during indexing
or when an attempt to remove a listed file fails.

Compatibility note for developers working with the .NET 2.0 API only: The
DLL dependencies for dtSearchNetApi2.dll have changed due to the release of the Visual Studio .NET
2005 Service Pack 1 Security Update for ATL. Because dtSearchNetApi2.dll is built with
the updated version of Visual Studio .NET 2005, it requires the updated MFC and CRT DLLs that are included with
that version. This Microsoft redistributable program will install the required components:
Microsoft Visual C++ 2005 Service Pack 1 Redistributable Package ATL Security Update (July 28, 2009)http://www.microsoft.com/downloads/details.aspx?familyid=766A6AF7-EC73-40FF-B072-9112BAB119C2&displaylang=en

This issue does not affect any other dtSearch Engine API.

Fixes and minor enhancements

Fixed bug in search report generation causing text in adjacent table cells to be run together in output

dtSearchw.exe: Fixed bug affecting use of drag-and-drop to re-order columns in search results, causing the
wrong column order to result

Fixed memory leak (in version 7.62 only) when using regular expressions in File Segmentation Rules to split documents or in Text Fields definitions.

File parser bug fixes affecting: MS Word 2007, PDF, ZIP, OpenOffice.

dtSearchw.exe: Added option to suppress automatic correction of hit highlighting when a document was indexed with a different version
or the document was modified since it was last indexed (in Options > Preferences > Document Display).

dtSearch 7.62 (Build 7804) Released July 20, 2009

All products

dtSearch Engine

Java API: Added IndexInfo object for more efficient retrieval of index properties from an index

Java API: Added SearchFilter.SelectItems() with array of doc ids

.NET API: Added SearchFilter.SelectItems() with array of doc ids

Java API: Added SearchJob.WantResultsAsFilter

FieldFlags: Added dtsoFfHtmlSkipImgAlt and dtsoFfHtmlSkipInputValues

Language Analyzer API: Added dtsLaBlockWasSkipped to LanguageAnalyzerWordFlags, providing a way for a
language analyzer to request that the internal dtSearch word breaker handle a block of text from the
input.

C++ API: Added userFields to dtsIndexProgressInfo, providing a way to access stored fields from
a document as it is indexed

Added dtsConvertAutoUpdateSearch flag to ensure consistent hit highlighting when a document was
modified since it was indexed or was indexed by an older version of dtSearch than is used to search it.

Fixes and minor enhancements

dten600.dll: Fixed error causing corrupt index with message referencing !zd and int vector error when
automatic recognition of dates is enabled, hyphen processing is set to dtsoHyphenAll, and the last word of
a document ends with a hyphen.

Added new cmap files for PDF text extraction.

Improved speed for "not (something)" and pure xfilter searches.

Reduced memory use for searches that retrieve large numbers of documents with a relatively small MaxFilesToRetrieve value.

dtSearchw.exe: Fixed RTF output generated from Search Report that would not open in WordPad

dtSearchw.exe: Fixed hit navigation error caused when text with highlight markings was pasted into a Word 2007 document from
dtSearch search results, and then the resulting document indexed and searched again.

dten600.exe: Several improvements to the automatic detection of MIME-encoded files

dtSearch 7.61 Build 7769 Released April 2, 2009

All products

dtSearch Desktop/Network

Added improved zoom-in/zoom-out for document windows. To use, hold down the Ctrl key and roll the mouse wheel forward or
backward, or press Ctrl+PLUS to zoom in and Ctrl+MINUS to zoom out.

dtSearch Engine

Added it_ContentAsXml output format for FileConverter. This format
organizes document content, metadata, and attachments into a standard XML format for easier automated
processing. It does not currently support hit highlighting and is designed for
automated content extraction only.

Fixes and minor enhancements

dten600.dll: Added workaround for invalid records created in PowerPoint files when
the same file is edited by PowerPoint 2003 and PowerPoint 2007

dtSearch.exe: Fixed formatting error and added diagnostic information to error message when file could not be opened

dtSearch.exe: Added support in Edit > Copy File for copying container files other than archives

dtSearch 7.54 (Build 7680) Released September 5, 2008

All products

New Microsoft Access file parser added with no dependencies on the Microsoft JET Engine or ODBC drivers,
so it works identically under Windows and Linux. This file parser supports Access databases created
by Microsoft Access 95 through Access 2007.

New file parser added for Flash (*.swf) files.

dtSearch Engine

Added support for searching for Unicode characters that are not defined as letters by default (such as currency symbols)
By default, all Unicode characters that are defined as letters in the Unicode specification are searchable. To make other characters such as Unicode currency characters searchable, you can add a line to the end of the alphabet file listing the Unicode characters to make searchable. Example:

This example makes all of the Unicode currency characters such as the Euro, Pound, and Lira searchable characters.

Added ListIndexFlags.dtsListIndexIncludeDocInfoAsXml flag to list all documents in an index in XML
format including document properties and stored fields. The ListIndex.exe utility also has a new dx option
to do this for an index from the command-line.

dtSearch Publish

Added option to enable links to Office documents (doc, xls, etc.) to launch in Office, outside
of the lbview.exe program. The setting is "ExternalLaunchExtensions" in the lbview.ini file.

See below for important information about Adobe Reader and Adobe Acrobat 9 compatibility

Adobe Reader and Adobe Acrobat 9 Compatibility

Adobe Reader and Adobe Acrobat 9 have a new setting that disables hit highlighting
in PDF files by default. To change this setting in Adobe Reader or Adobe Acrobat 9,
click Edit > Preferences > Search, and check the box to "Enable search highlights
from external highlight server".

dtSearch Desktop 7.54 will automatically make this change when it detects that
hit highlighting is disabled (after asking permission). dtSearch Desktop 7.54
also fixes a problem that causes PDF files to open very slowly in dtSearch with Adobe Reader or
Adobe Acrobat 9 installed.

dtSearch Publish 7.54 has new settings in the lbview.ini file to automatically enable
PDF hit highlighting when it is disabled (after asking permission).

dtSearch Web cannot change this setting in client applications, so web sites that
rely on Adobe Reader to highlight hits in PDF files should notify users of the
need to change this setting to preserve hit highlighting.

Fixes and minor enhancements

dten600.dll: To improve consistency in the handling of punctuation in field names, unsearchable characters
are now removed from field names in input data, with a few exceptions (:&_+=.) to minimize the effect on backward
compatibility. In previous versions this was done in indivdual file parsers so the effect of punctuation in
field names depended on the format of the input data.This change will not generally affect searching because only searchable letters are used when
matching field names.This change may affect the field names associated with stored fields,
in cases where the field name contains punctuation characters.

dtSearchNetApi2.dll: If data source throws an exception, IndexJob will catch it and report the exception through
the Errors object

dtSearchNetApi2.dll: Added FileInfoFlags.fiOpenFailed to indicate when a document returned from a DataSource
with the DocIsFile flag set to true cannot be opened because it is either not present or locked

dten600.dll: Fixed merge bug affecting merges of indexes containing the same container file with the newer version of the
container file in the target index

dtSearch 7.51 (Build 7556) Released January 18, 2008

dtSearch Desktop

Search results saved as XML include the selection state of the items in the search results list (i.e., checked or unchecked). Search results
saved in other formats such as CSV can either include all items or just selected items.

dtSearch Web

Added 64-bit version of dtSearch Web and dtSearch Web Setup

Fixes and minor enhancements

dtSearch.Spider2.dll: Added 64-bit version of the .NET Spider API

dtSearchw.exe: Added status bar indicator for total hits and total number of files retrieved

lbvprot.dll: Fixed error in standard CD type causing it to get stuck at an "Opening page..." message

dtSearch 7.50b (Build 7518) Released November 25, 2007

Fixes and minor enhancements

dten600.dll: Fixed SearchReportJob error causing more blocks of context than specified by MaxContextBlocks to be included
in the generated report.

lbvprot.dll: Fixed error starting CGI applications

dtSearch 7.50 (Build 7517) Released November 9, 2007

Enhancements (All products)

Improved integration with external language analyzers: (1) Language analyzers will be given much larger chunks
of text to analyze, which enables some language analyzers to operate more effectively.
(2) Language analyzers will be given consistently-sized
chunks of text whether indexing or highlighting hits, which ensures that hit highlighting will not be affected by
changes in the behavior of a language analyzer depending on the size of the data it receives.

dtSearch Desktop

In the Edit > Copy File dialog box, added option to preserve original modification,
creation, and last access times of the original files

In the Edit > Copy File dialog box, added option to copy the entire container file
when a matching document is inside a container (such as a ZIP file or email archive)

Option in Options > Preferences > Indexing Options > Letters and Words to automatically
insert a word break around Chinese, Japanese, and Korean characters in text. This makes it possible
for documents that do not contain word breaks to be searched.

dtSearch Engine

64-bit version of the dtSearch Engine with C++ and .NET APIs.

New dtsoTfAutoBreakCJK flag in Options.TextFlags to automatically
insert a word break around Chinese, Japanese, and Korean characters in text. This makes it possible
for documents that do not contain word breaks to be searched.

ListIndexJob added to the Java API

New dtsListIndexIncludeDocCount flag added to ListIndexFlags, to provide a the document count
for each word listed

Fixes and minor enhancements

dten600.dll: Added dtsLaJob.pFileInfo to provide language analyzer with a dtsFileInfo describing the document
being processed

dten600.dll: Added dtsLaJobInputIsFirstBlockInDocument value for dtsLaJob.flags to tell language analyzer when
a new document is starting

dten600.dll: Fixed error in title attribute for documents indexed using the COM implementation of
the data source API

dten600.dll: Changed the behavior when a container document such as a ZIP file is removed from an index using its doc id
(using IndexJob.ActionRemoveListed). Instead of just removing the container, the container and all contained items will be removed.

dtSearch 7.41 (Build 7420) Released April 21, 2007

Enhancements (dtSearch Engine)

Added support for automatically varying hit weights according to the field they occur in, through the new
SearchJob.FieldWeights setting. For more information, see the "Relevance" topic in the dtSearch Engine API Reference.

Compatibility note for developers working with the .NET 2.0 API only: The
DLL dependencies for dtSearchNetApi2.dll have changed due to Visual Studio .NET
2005 Service Pack 1. Because dtSearchNetApi2.dll is built with
Service Pack 1, it requires the updated MFC and CRT DLLs that are included with
that version. Executing the vcredist_x86.exe included with Visual Studio
.NET 2005 Service Pack 1 (dated December 2, 2006 or later) will install these
components.
This issue does not affect any other dtSearch Engine API.

Fixes and minor enhancements

dten600.dll: New fields added as properties of .eml files - CC, BCC, and Attachments (a list of the filenames of all
attachments).

dtSearch .exe and .msi files digitally signed for better operation in Windows Vista

Enhancements (dtSearch Engine)

Added IndexCache object in the .NET 2.0 API, and dtsIndexCache object in the C++ API, to enable much faster searching
when a series of searches must be done against a small number of indexes. The IndexCache maintains a thread-safe pool of open
indexes that are available for searching during the lifetime of the cache. Using the cache eliminates the need to open and close
the index for each search

Enhancements (dtSearch Desktop)

Added option in Options > Preferences > Spider Options to log the links found in each page the Spider follows.

Added option in Options > Preferences > Search Options to change the maximum number of words a search request
can match.

Fixes and minor enhancements

dten600.dll: When serializing stored fields to XML, add a _ in front of any stored field names that begin with a digit
so the resulting XML remains syntactically correct.

dten600.dll: In the C++ API, the pOnIndexWordFn callback was called with encoded field information in addition to the text of the word,
and if the called function did not preserve this field information intact, field attributes could become invalid. To prevent this, in
version 7.40 the field information is removed before the callback so pOnIndexWordFn will not see or be able to affect field attributes.

dtSearch 7.30 (Build 7320) Released September 30, 2006

Enhancements (All products)

Added preliminary support for Word 2007 (*.docx) and Excel 2007 (*.xlsx) based on the current
Office 2007 beta and available documentation.

Added support for JPG and TIFF metadata, including EXIF and IPTC fields.

Unicode filtering file parser can handle individual documents larger than 2 Gb, and support for files
larger than 2 Gb added to the extext.exe utility

Improved handling of partially inaccessible email files. In previous versions, if an email
had encrypted or corrupt data (for example, an encrypted attachment), the whole email was
reported as encrypted or corrupt. In this version, the readable portion of the message
is indexed and the unreadable portion is separately reported as a partially encrypted or
partially unreadable file. This change applies to Outlook messages, TNEF files, .eml files,
MBOX archives, and .msg files.

Enhancements (dtSearch Engine)

Beta x64 (64-bit) versions of the dtSearch Indexer and dtSearch Engine (dtIndexer64.exe, dtengine64.dll, and
dtSearchNetApi2.dll. The index format and APIs (C++, COM, and .NET) are identical to the 32-bit version.
The 64-bit components are in a separate download file (dtSearch64_730.exe) with the same installation password
as the dtSearch Engine SDK.

dten600.dll: PDF file parser detects and handles case where text in right-to-left languages (Hebrew or Arabic) is
stored backwards (left-to-right) in a PDF file, and automatically inverts the characters in the word so it will be correctly searchable

dtSearch 7.22 (Build 7217) Released March 14, 2006

Enhancements (dtSearch Engine)

.NET 2.0 API for Visual Studio .NET 2005. The .NET 2.0 API wrapper is dtSearchNetApi2.dll, and the .NET 2.0 version of the Spider API is dtSearch.Spider2.dll.
The API is identical to the .NET 1.1 API. For sample code, see the examples\cs2 and examples\vb.net2 folders.

dten600.dll: Excel file parser defaults to 10 digits of precision for numbers without a
specified format (consistent with
Excel).

dten600.dll: Minor improvements to Unicode filtering algorithm.

dtSearch 7.21 (Build 7164) Released January 23, 2006

Enhancements (All products)

IFilter support to enable
dtSearch to parse document types such as Microsoft OneNote and AutoCAD that
include IFilters. IFilters are components that enable various Microsoft search products, such as Microsoft
Index Server, to extract text from documents. For example, when you install Microsoft
OneNote, an IFilter is installed to enable searching of *.one files. To tell dtSearch
to use installed IFilters to process some of your files, set up a rule in Options > Preferences > File Types and
under File type, select "IFilter". In dtSearch Engine applications, use the FileTypeTableFile to specify the filename
patterns to use with IFilters. The IFilter adapter only works on systems with the Microsoft component query.dll installed.
For information on products that include query.dll, see
http://support.microsoft.com/dllhelp
For more information on IFilters, see
http://www.ifilter.org/ or
http://channel9.msdn.com/wiki/default.aspx/Channel9.DesktopSearchIFilters

Fixes

dten600.dll: Fixed bug that prevented some items in ZIP files from being displayed after a search (an "unable to access
input file" message would appear instead).

dtsearch.h: unnamed unions removed from the dtsMessage structure. This will not affect binary compatibility
but may require source code changes in C++ code that accessed undocumented union members. Because the removed
union members were undocumented, this change should affect very few programs.

Enhancements (dtSearch Desktop)

dtindexer.exe: added /caf and /cat command-line option to cache text (/cat) or cache original files (/cad), when creating indexes using
the command line, and /recog to recognize an index.

Added Help > Check For Updates feature to automatically download new
versions

Enhancements (dtSearch Engine)

dtSearch.Spider.dll component provides a .NET API for the dtSearch Spider. For API documentation, see dtSearchNetApi.chm.
For sample code, see C:\Program Files\dtSearch Developer\examples\cs\SpiderDemo.

New xfilter search type, "ext", to search only on the filename extension (dot required). Examples:
xfilter(ext ".doc") matches file with a .doc extension; xfilter(ext "~.doc") matches file without a .doc extension;
xfilter(ext ".") matches file with no extension. This search feature will only work with documents that were indexed with
dtSearch 7.2 or later.

SearchReport supports %%FirstHit%% macro in ContextHeader to indicate the word offset of the first hit in the context block

dtsIndexCacheTextWithoutFields flag added to IndexingFlags. This flag makes it possible to cache text (for generation of
a synopsis to include in search results) without including any of the fields added using the data source API.

dtsErAccCachedDoc flag added to ErrorCodes. This error code indicates that a document could not be extracted from
the document cache in an index (this usually means that the index was created without caching enabled)

dtsConvertJustDetectType flag added to ConvertFlags, to have FileConverter or DFileConvertJob just detect the file format of
a document. The format is returned in FileConverter.DetectedTypeId.

dtsReportIncludeFileStart flag added to ReportFlags. This flag causes a block of text from the beginning of the
document to be included in the generated search report.

A new search feature makes it possible to restrict a search to the text of documents (excluding any metadata). To search for text that is not
in any field, search for //text contains (search request). Example: (//text contains apple) and (author contains smith)

Fixes

DynaZip unzip component (dunzip32.dll) updated to new version that eliminates buffer overrun vulnerability in earlier versions.

dtSearch 7.10 (Build 7045) Released August 8, 2005

Enhancements (dtSearch Engine)

Added two new ASP.NET samples, one in VB.NET and one in C#, that demonstrate a search interface
using a grid control for search results. The new samples are installed to C:\Program Files\dtSearch Developer\examples\asp.net.
Please see the readme file in the project folders before trying to open them in Visual Studio -- a virtual directory mapping
for C:\Program Files\dtSearch Developer\examples\asp.net has to be created first or Visual Studio will not be able to open
the project.

GetNthWordDocCount added to WordListBuilder to get the number of documents a word occurs in

SearchReportJob enhancements: Added ContextSeparator; itUnformattedHTML output format, for easier generation of
a synopsis; faster generation of search report when search results cover multiple indexes; dtsReportLimitContiguousContext flag
to prevent very large synopsis when there are many hits close together.

In the OnFound callback notification in the C++ and .NET interfaces, an application can veto individual items
to prevent them from being included in search results. See SearchResultsItem.VetoThisItem (.NET) and
DSearchJob::VetoThisItem (C++ Support Classes).

dtSearchNetApi.dll uses registry type library information and delay loading
to eliminate the need for dten600.dll to reside on the system PATH in ASP.NET
applications.

New TextFlags option to suppress automatic generation of
xfirstword and xlastword (dtsoTfSkipXFirstAndLast)

Options.MaxFieldNesting setting to limit the permissible depth of field nesting

dtSearch 7.00 (Build 7008) Released May 18, 2005

Enhancements (All products)

High-capacity index format released, with support for over 1 terabyte of data per index.
dtSearch 7 can update and search indexes created with dtSearch 6.
To upgrade an index to the version 7 format using dtSearch Desktop,
(1) click Index > Update Index...,
(2) Check the box to "Upgrade index to version 7 format".
(3) Click "Start Indexing"

Enhancements (dtSearch Desktop)

Faster indexing of Outlook message stores. Indexing speed is substantially faster, especially for incremental updates.
For compatibility, existing Outlook indexes will continue to use the previous indexer, so the improvements
will only apply to new indexes.

Edit > Copy File can copy retrieved Outlook messages as .msg files (formerly they were converted to HTML).
This only works for Outlook messages indexed using the new Outlook indexer (see above).

mapitool.exe command-line utility to convert PST files or other Outlook-accessible message stores
to MSG files. See mapitool.html for documentation.

Enhancements (dtSearch Engine)

Support for hidden stored fields. Hidden stored fields are returned in search results like stored fields, but
are not displayed as part of the document and are not searchable. Only fields returned through the data source API
in DocFields (.NET/COM) or dtsInputStream.fields (C++) can be hidden stored fields. To designate a field as hidden,
insert ** in front of the field name.

Note: dtv_ms.dll, an external file parser included in prior versions, is now compiled into dten600.dll.

dtSearch 6.4 (Build 6482) Released September 4, 2004

Enhancements (All products)

Improved indexing speed

Support for indexing .tar, .gz, and .tgz archives

Support for indexing metadata (Author, Title, etc.) in MP3, ASF, and WMV files

Improved file parser for RTF files.

XML parser improvements in handling incorrect XML input, such as mismatched tags and > and < characters
in field text

Enhancements (dtSearch Web)

Web search form remembers users' search criteria from prior searches

Many improvements to search form generation: more controls including date range control,
simpler search forms, automatic generation of field searching controls;
stylesheet-based search form and search results formatting; Form Builder remembers search form option
settings from previously-built form

dtSearch Web Setup has new options to upgrade all dtSearch Web installations at once, and to
remove a dtSearch Web installation

dtSearch 6.33 (Build 6430) Released April 21, 2004

Fixes

dtSearch 6.32 (Build 6429) Released March 5, 2004

Enhancements (All products)

Improved relevancy ranking using positional scoring, which ranks documents higher
when hits occur near the top of the file or are clustered within a document. In dtSearch Desktop
and dtSearch Web, positional scoring is applied automatically when automatic term weighting
is selected. In dtSearch Engine applications, use the new dtsSearchPositionalScoring search flag
to enable positional scoring.

Enhancements (dtSearch Spider)

Added option to allow Spider to crawl across multiple servers from a single starting URL

Added option to limit maximum size of items Spider can download from a site

Added option to limit number of files Spider can index on a web site

Added option to limit number of minutes Spider can spend indexing a single web site

Enhancements (dtSearch Engine)

Added TextFlags option dtsoTfSkipNumericValues. By default, dtSearch indexes numbers both as
text and as numeric values, which is necessary for numeric range searching. Use this flag
to suppress indexing of numeric values in applications that do not require
numeric range searching. This setting can reduce the size of the index by about 20%.

Added dtsoFfSkipFilenameFieldPath FieldFlag, to allow indexing of the filename as a field
without the whole path

dtSearch 6.30 (Build 6386) Released November 11, 2003

Enhancements (All products)

Added pre/N connector, which is like W/N but requires that the first expression occur before the second

Enhancements (dtSearch Desktop)

Added option in Options > Preferences > Filtering Options to control the minimum size of a text segment

Added option in Options > Preferences > Search Results to remember the sort order from the previous search.
For example, if you click the Filename column to sort search results by filename, after the next search results
will be displayed sorted by filename. The remembered sort order overrides whatever was selected in the
Search dialog box.

Enhancements (dtSearch Web / dtSearch Publish)

Enhancements (dtSearch Engine)

Language Analyzer API for integration of third-party language tools (such as a Japanese or Chinese dictionar-based
word breaker, or an Arabic morphological analyzer). For more information, see the "Language Analyzer API" topic
in the dtSearch Engine help file, dtengine.chm

Fixes

dten600.dll: Text Fields limited to first N lines were applied through entire file for WordPerfect files

dtSearch: Fixed bug affecting internal links in HTML files opened after a
search

dtIndexer: Bug in index update scheduler prevented a scheduled task from being created if
another task had previously been created, and deleted, with the same name

dten600.dll: Search report generated using exact words of context contained an incorrect character at end of a context block
if the last word in the context block was followed by a Unicode punctuation mark.

dtSearch: After installing Adobe Reader or Acrobat 6, PDF files opened very slowly in dtSearch Desktop.

dten600.dll: In MIME-encoded emails with multipart/alternative encoding (message included in both HTML and text),
message text appeared twice.

dtsConvertXmlToXml flag enables FileConverter to highlight hits in XML documents with XML output. For more information, see
"Highlighting hits in XML" in the dtSearch Engine help file, tech support
article dts0183, and
this demo page.

Fixes

dtSearch: TAB key to switch between search results and document window did not work

dtIndexer: The "Update Multiple Indexes" dialog box could add web site content from an index to subsequently-updated indexes in the same session

dtSearch 6.20 (Build 6320) Released May 6, 2003

Enhancements (dtSearch Desktop)

Support for indexing Outlook 2003 messages. (Note: This is based on
the Office 2003 Beta 2. It is possible that Microsoft may make additional
changes to Office 2003 when it is released that will require more changes
to the dtSearch Outlook indexer.)

New search results list

The search results list can contain a brief synopsis for each item showing the first couple of hits
and a few words of context around each hit. To enable this feature, click Options > Preferences > Search results,
and check the box labelled "First hits in context".

Click the <-> mark in the upper left corner of search results to automatically
size columns to fit their contents. Click it a second time to automatically size columns to fit
in the search results window. (Click Options > Preferences > Search results to have
search results automatically resized in either way.)

Drag and drop column headers to change the order in which search results items appear

Items in search results can be dragged to Explorer (to move the files) or to
email programs (to send the files as an attachment).

(To make dtSearch use the search results list from prior versions, run dtsearch.exe or dtsearchw.exe with the /lv command-line switch.)

New text filtering option for indexing recovered forensic data. See the Options > Preferences > Filtering options dialog box
and the "Filtering options" help topic in the dtSearch help file for more information

New option in the Options > Preferences > File Types dialog box to
require that a set of files be indexed as HTML or plain text, even if they
appear to have a different format.

Enhanced filename filters for use inside ZIP archives. See the "Filename Filters" topic in the dtSearch help file for more information.

Press Ctrl+Shift+UP to enlarge the text font, or Ctrl+Shift+DOWN to reduce the text font.

Edit > Copy File handles items in container files (such as ZIP archives or databases) better,
copying only the items retrieved from the search rather than the entire container. For example, if a search
retrieves sample.doc inside c:\archive.zip, then "Copy File" will extract and copy sample.doc rather than copying
the whole archive.zip file.

Enhancements (dtSearch Web / dtSearch Publish)

New ResultsTableItem macros: %%PhraseCount%% (number of hits in a document,
counting each phrase as a single hit) and %%HitsByWord%% (list of words or phrases
matched in a document, with the number of hits on each).

New settings for dtsearch_options.html: HttpProxy and SERVER_NAME. See dtSearch_Web.chm for more information.

Enhancements (dtSearch Engine)

Added ZIP file parser to Linux version

Added Java JNI API to Linux version

ExText text extracting algorithm, for indexing recovered forensic data,
integrated into the dtSearch Engine as the "Filtered Binary" file parser.
See "Filtering options" in the dtSearch Desktop help file for information on how
this filtering algorithm works. A new value for the Options.binaryFiles
flag, dtsoFilterBinaryUnicode (4), enables this parser.

FileConverter and dtsFileConvertJob2 have new typeId property that can be used to
specify the file parser to be used with the input.

Serialized search results include search flags and fuzziness

Added dtsoFfXmlHideFieldNames field flag to suppress indexing of field names
in XML files.

Data source API (C++): Added pFileInfo member to dtsDataSource to provide
information on the last file indexed. See "dtsDataSource" in dtengine.chm
for more information.

Data source API (C++): Added typeId to dtsInputStream to provide a way to specify the
file parser that should be used for an input file. See "dtsInputStream" in dtengine.chm
for more information.

Data source API (VB/ASP): Added DocId, DocWordCount, and DocTypeId
properties of DataSourceToIndex to provide information the last file indexed.
See "Indexing ActiveX Data Sources" in dtengine.chm for more information.

Data source API (Java): Added getDocId, getDocWordCount, and getDocTypeId
methods in new DataSource2 class to provide information the last file indexed.
See the DataSource2 topic in the JavaDoc documentation for more information, and
see the Java dsource sample application for sample code.

Data source API (Java): Added getDocBytes() to provide a way to return
documents in a memory buffer

Java FileConverter and SearchReportJob objects: Added setDocBytes() for a calling program to
provide an input document in a memory buffer

phraseCount, reporting the number of hits matched in a request with each phrase
counted as a single hit, is computed if dtsSearchWantHitsByWord search flag is set.
This value can be accessed in the C++ API as dtsSearchResultsItem.phraseCount, and
in the COM and Java interfaces as DocDetailItem("_phraseCount").

Added dtsSearchJob.maxFilesToRetrieve2 to provide a 32-bit version of this limit,
and made the Java and COM maxFilesToRetrieve properties 32-bit.

Spider uses WinHttp 5.1 library under Windows XP and Windows 2000 SP3. This library provides improved handling of
authentication and generally better performance (in our testing). WinHttp 5.1 is included with
Windows XP and Windows 2000 SP 3 and is not available for other platforms.

"Create Group Policy" dialog box added for automatic deployment of dtSearch and shared indexes across a network. See
the "Automatic deployment of dtSearch on a network" topic in the dtSearch Desktop help file for more information.

Outlook indexer can index appointments and journal items, in addition to messages, tasks, contacts, and notes

Enhancements (dtSearch Engine)

New HttpSearch sample application demonstrates client-server searching with dtSearch Web on the server and
the CHttpSearchJob C++ class used on the client to perform a search.

SearchFilter object -- added SelectItemsBySearch method to allow a search
filter to be set up as the results of one or more searches.

pOnIndexWordFn added to the dtsIndexJob in the C++ API, to allow an
indexing program to modify text as it is being indexed (to customize character
handling or add alternative forms of a word to the index). See the
dtsOnIndexWordInfo topic in dtengine.chm for more information.

Fixes

dten600.dll: Default name for alphabet file changed from "ENGLISH.ABC" to "DEFAULT.ABC", since it no longer
has language-specific data

dtindexer.exe: Merge indexes did not combine the list of folder selections for the merged indexes

dtSearch.exe: "fields" button in Search dialog box did not position the cursor correctly after fields
were inserted into a search request

dtindexer.exe: Spider reports when a web page cannot be accessed due to an authentication error
or server error.

dten600.dll: If an index path contains an accented letter, the noise word list was not used and all words
were indexed.

dtSearch 6.07 (Build 6205) Released June 17, 2002

Enhancements (All products)

Faster indexing for large indexes (typically about 20% faster)

Added support for UCS-16 encoded HTML and XML (little-endian and big-endian)

Blocks of HTML can be excluded from indexing using <!--BeginNoIndex--> and <!--EndNoIndex--> tags. (The comment
tags must appear exactly they appear here, with no spaces or other variations.)

Improved formatting in display of Excel spreadsheets

Enhancements (dtSearch Desktop)

Option to automatically use "Report" view for very long text files

Search reports can include documents indexed with the spider (http: references) and Outlook messages

Edit|Copy file list to copy a list of filenames from search results to the clipboard

Indexing Options: new option to "Index HTML scripts, styles, links, and comments" (causes these items to become visible and searchable in dtSearch)

Improved status reporting in the Forensic Indexer (forix.exe).

"View File" can be used to open a saved search results list

Enhancements (dtSearch Engine)

dtsfclib.lib: updated to work with Visual Studio .NET. See "C++ Support Classes" in dtengine.chm

C++ Support Classes can optionally be placed in a "dtSearch" namespace by declaring the USE_DTSEARCH_NAMESPACE macro

SearchFilter object - added Read and Write functions to save to/read from disk files, ReadMultiple to
read and combine multiple filters, and AND and OR operations to logically combine two filters

Added percentFull property in dtsIndexInfo

An alternative search syntax, the "All Words"/"Any Words" syntax, can be used in searches using the dtsSearchTypeAllWords
and dtsSearchTypeAnyWords search flags. This new search syntax supports use of quotation marks to indicate phrases and
+ and - to indicate required and excluded words. See "dtsSearchTypeAllWords and dtsSearchTypeAnyWords" topic in dtengine.chm.

andany search connector added to allow optional words to be added to a search request.
(see "Search Requests" help topic in dtengine.chm)

Enhancements (dtSearch Desktop)

Default character encoding option setting in the Options > Preferences File Types dialog box, provides a way to override
automatic encoding detection for file types that do not contain encoding information

Search filters provide a way to pre-select the documents that are eligible to be returned in a search. This feature
can be useful when a database search and a dtSearch text search must be combined. See the SearchFilter (VB) or dtsSearchFilter (C++)
topic in the dtSearch Engine help file for details.

dtSearch Web (dtisapi6.dll) logging search requests and document access. For more information, see the "Generated Files" topic in dtSearch_Web.chm.

Edit|Select all and Edit|Clear selections commands for easier selection of items in search results

"Search for List of Words" search type in the Search menu, for searching using the contents of a text file as the search request.
Results of the search can be displayed in dtSearch Desktop or can be written to a text file.

dtSearch.exe, dtSearch Web: Sorting search results by filename did not check, for PDF/HTML files, whether the filename or the Title was currently displayed as the filename.
As a result, the list was always sorted by the title for these file types.

dten600.dll: On a sort by Location, items within a particular location were sorted in ascending order by score, rather than descending order

dten600.dll: Fixed error in WordPerfect file parser character tables for processing some Arabic and Greek characters

dtSearch Web: JavaScript for "Next Hit" button did not find last hit in a document

dten600.dll: Report generated by dtsSearchWantHitDetails flag had hits in descending instead of ascending order

dtSearch 6.03 (Build 6079) Released June 1, 2001

Enhancements (dtSearch Desktop)

dtSearch.exe can run from a shared network folder (see the help file for details)

Much faster loading of long documents

Loading of long documents can be cancelled

Option setting not to automatically open the first document

Hit highlighting shows which hit is currently selected

Edit|Copy File added

New UTF-8 file type in File Types dialog box

User-defined fields can be included in search reports and search listings

New dtsrun.exe launcher to select dtsearch.exe or dtsearchw.exe automatically

Enhancements (dtSearch Engine)

New CD Wizard for setting up dtSearch Web to run from a CD. [This new option for CD publishing
will not be released with version 6.03 due to the need for more beta testing. It is included in
pre-release form for testing and evaluation purposes only.]

dtSearch 6.02b (Build 6055) Released April 9, 2001

dtSearch 6.02 (Build 6055) Released April 2, 2001

dtsearch.exe: SPACE and BACKSPACE do not work in HtmlHelp when launched from dtSearch. The reason for this is that
dtSearch hotkeys are still active when help is launched, and in dtSearch SPACE = Next Hit and BACKSPACE = prev hit.

dtsearch.exe: Search dialog box, under Windows 98, does not move the cursor to the end of the search request when the user clicks AND/OR/etc. buttons

dtsearch.exe: "Include word counts" checkbox in List Index dialog box did not work

dtsearch.exe: search dialog box warns when unindexed searching is left on, and when file filters are left over from a previous search under More Search Options. The warnings can be suppressed with a checkbox, and there is a new Options|Enable warnings... command to enable all optional warnings.

dtsearch.exe: Change to booleanConnectors setting did not work if boolean connectors string was too long

dtsearch.exe: "Search in a new window" did not do anything under Windows 9x

dtindexer.exe: warns when the "Clear index before adding documents" box is checked and the index already has data (like 5.25 did -- people have been asking why dtSearch fully rebuilds their indexes each time)

dten600.dll: Word documents created by a Japanese version of Word 95 were not recognized

dten600.dll: HtmlTitle field sometimes not generated for contents of the HTML <TITLE>

dten600.dll: crash indexing a corrupt WordPerfect document

dten600.dll: SearchReportJob sometimes put a line break between the beforeHit mark and the first letter of the hit word