dtSearch Corp., a leading supplier of enterprise and developer text retrieval and file conversion software, announces Version 7.66 of the dtSearch product line, including a native 64-bit .NET 4 SDK for the dtSearch Engine. The .NET 4 SDK covers the Spider API for indexing local and remote, static and dynamic web-based content, encompassing both public Internet and secure Intranet data. The .NET 4 release also has a sample application for the Microsoft Azure cloud platform. And the new SDK offers performance enhancements for faceted searching involving millions of document metadata tags or database records.
The dtSearch Engine for Win & .NET and the dtSearch Engine for Linux make available dtSearch’s data searching and file format support for use in a wide variety of Internet, Intranet and other commercial applications. (For more than a hundred publicly-available developer case studies, please see www.dtsearch.com/casestudies.html.) The dtSearch Engine has 64-bit and 32-bit .NET, Java and C++ APIs.
In addition to the dtSearch Engine, the new release also spans the rest of the dtSearch product line: dtSearch Web with Spider for quickly publishing instantly searchable data to an Internet or Intranet site, dtSearch Network with Spider for instantly searching across a network, dtSearch Publish for publishing searchable data to portable media, and dtSearch Desktop with Spider for desktop search. Following are key features of the dtSearch product line.
Terabyte Indexer. dtSearch products can index over a terabyte of text in a single index. They can also create and simultaneously search an unlimited number of indexes. Indexed search time is typically less than a second, even across terabytes of data. The new release also offers performance enhancements to a forensic indexing version of the terabyte indexer.
Built-In Proprietary File Parsers / Converters. dtSearch products embed dtSearch’s own file parsers and converters covering a wide range of popular file types. The product line displays existing web-ready content like HTML, PDF and XML/XSL with highlighted hits, and with images, links and formatting intact. The products convert other file types to XML or HTML for browser-based display with highlighted hits. The current version supports MS Office through MS Office 2010 (including current versions of Access, Excel, PowerPoint, and Word), OpenOffice, CSV, ZIP, etc.
Support for Emails and Attachments. The dtSearch product line can also index and search email messages such as Exchange, Outlook and Thunderbird, as well as attachments, nested attachments, and zipped attachments.
Spider. The dtSearch Spider adds local or remote website content to a searchable data collection. The Spider provides integrated hit-highlighted searching of dynamic content like MS SharePoint, CMS, PHP and ASP.NET, as well as static content like HTML, PDF, XSL/XML. Searches can span any level of depth, across any number of sites, including public sites, intranets, and secure and password-accessible sites. A .NET Spider API (including the new .NET 4 version) makes the Spider functionality accessible to programmers.
Cloud Applications. The dtSearch Engine has been used for years in a variety of third-party SaaS-hosted applications. The new version includes sample .NET 4 code for the MS Azure cloud platform.
25+ Search Options; International Languages. dtSearch offers over 25 full-text and fielded data search options, including advanced federated search options and special forensics features. Unicode support covers all Unicode languages, including support for right-to-left languages and Chinese/Japanese/Korean character processing options.
Faceted Searching; Database and Other Data Classification Objects. The dtSearch Engine offers a wide range of API filters and objects to support categorization via document full-text contents, document fields, database contents, or data attributes attached during indexing. The new SDK enhances hierarchical sorting or “faceted search” performance across data sets with millions of metadata tags.
For faceted searching, the metadata tags themselves may reside as fields inside of existing documents, as fields added “on the fly” during indexing, or as fields in a structured database like XML or SQL. For database searching, the dtSearch Engine supports indexing documents referenced by a database, as well as documents stored as BLOB data inside the database.