dtSearch Corp., a leading supplier of document filters and general enterprise and developer text retrieval software for instantly searching terabytes of data, announces version 7.73 of its product line. The new version enhances .NET, C++ and Java API options for dtSearch Engine developers embedding the document filters for data parsing, conversion, extraction, and display of retrieved data with highlighted hits.
Available with the present release, a new beta version of dtSearch Web with Spider provides HTML5 template enhancements for publishing instantly searchable data to an Internet or Intranet site. dtSearch Web works as an easy, no-programming-required solution for enabling instant, concurrent searching across terabytes of static and dynamic online data, with highlighted hits and dozens of other search options.
Document filters overview. dtSearch’s proprietary document filters support a broad range of data types. The document filters cover parsing, indexing and searching of retrieved full-text and metadata. Support also covers display of metadata and full-text data with highlighted hits. (Typically, dtSearch does this following dtSearch’s own automatic, built-in conversion of the data to HTML.) In many cases, the document filters also support integrated image display along with highlighted hits.
Supported data types. dtSearch support covers full-text and metadata display with highlighted hits; where indicated, support also covers integrated images along with text.
• Web-ready static and dynamic content: support covers integrated image and text support in HTML, XML/XSL, PDF, ASP.NET, PHP, SharePoint, etc.
• Other databases: support covers XML, Access, XBASE, CSV, etc.; dtSearch Engine APIs support SQL-type data along with the full-text of BLOB data.
• MS Office formats: support covers integrated browser-ready image and text support in Word (RTF/DOC/DOCX), PowerPoint (PPT/PPTX), Excel (XLS/XLSX), Access (MDB/ACCDB) and OneNote (ONE).
• PDF, other “Office” documents, compression formats: support covers PDF with integrated image and text support, OpenOffice, RAR, ZIP, GZIP/TAR, etc.
• Emails and attachments: support covers integrated browser-ready image and text support—plus support for attachments—in Outlook/Exchange (PST/MSG) and Thunderbird (MBOX/EML).
• Recursively embedded objects: support covers recursively embedded objects and images in supported email types and MS Office formats. For example, the dtSearch document filters would support an email attachment consisting of a ZIP container including both a PDF and an Access database, where the latter also includes an embedded PowerPoint with embedded images.
Terabyte Indexer. dtSearch enterprise and developer products can index over a terabyte of text in a single index, spanning multiple directories, emails and attachments, online data and other databases. The products can create and search any number of indexes. Indexed search time is typically less than a second, even across terabytes of data. The product line also supports highly concurrent, multithreaded searching for online and other shared access repositories.
Federated Searching and the dtSearch Spider. dtSearch products offer federated searching across any number of directories, emails (with nested attachments), and databases. The dtSearch Spider adds local and remote, static and dynamic online content to a search. The Spider can index sites to any level of depth, with support for public and private or secure online content, including log-ins and forms-based authentication. dtSearch products support integrated relevancy ranking with highlighted hits across both online and offline data repositories.
Faceted Search and Other Data Classification Options. The dtSearch Engine supports categorization based on document full-text contents, internal document metadata, database content, or data attributes associated with documents during document indexing. The dtSearch Engine has APIs for other advanced data classification options as well, such as faceted search and full-text and/or fielded data positive and negative variable term weighting.
25+ Search Options and International Language Support. The dtSearch product line
offers 25+ search types, including special forensics search options. dtSearch products provide Unicode support for international language text, including support for right-to-left languages, and special Chinese/Japanese/Korean character options.
Developer SDKs. The dtSearch Engine for Win & .NET and the dtSearch Engine for Linux make available dtSearch instant searching and document filters (both together with searching as well as available for separate licensing) for a wide range of Internet, Intranet and other commercial applications. SDKs include native 64-bit and 32-bit C++, Java and .NET (through current versions) APIs. For over a hundred developer case studies, please see www.dtsearch.com/casestudies.html.