ΟΙ ΣΥΝΕΡΓΑΤΕΣ ΠΟΥ

ΣΥΜΒΑΛΛΟΥΝ ΕΝΕΡΓΑ ΣΤΗΝ ΕΠΙΤΥΧΙΑ



STATISTICA Data Miner


Highlights: Unique Features of STATISTICA Data Miner

 

Request a trial

Discover how we can put
your data to work for you.

Rexer 2013 Satisfaction Chart

STATISTICA maintains
high marks in 2013
Rexer Data Miner Survey

  • Data Mining with STATISTICA offers a wealth of options and techniques not available in competing products. These features can be critical to maximize ROI in a competitive environment.
  • STATISTICA Data Miner can be used by novices, and even offers automatic model builder and wizard-like "Data Miner Recipes," yet offers the most comprehensive selection of methods and techniques for experts to solve even the most complex problems
  • STATISTICA Data Miner is the most versatile data mining tool available, giving you all the right tools to gain critical business/process insights quickly, and to act on those insights (“deploy”) for instant ROI
  • STATISTICA Live Score®, an optional data mining tool, provides an efficient method of deploying data mining models.
  • STATISTICA Data Miner is optimized for use with large data files and with the latest version has been enhanced further to speed computation time and improve scalability and performance.  The tools process extremely large data files efficiently by extensive use of multithreading. (see benchmark comparisons)

Overview

  • The most comprehensive and effective system of user-friendly tools for the entire data mining process - from querying databases to generating final reports.
  • To the best of our knowledge, STATISTICA Data Miner contains the most comprehensive selection of data mining methods available on the market; e.g., by far the most comprehensive selection ofclustering techniques, neural networks architecturesclassification/regression trees (also called recursive partitioning methods), multivariate modeling (including MARSplinesSupport Vector Machines), association and sequence analysis (an optional add-on), and many other predictive techniques; even methods for advanced/true simulation and optimization of models are provided
  • STATISTICA Data Miner also provides the largest selection of graphics and visualization procedures of any competing products, to enable effective data exploration and visual data mining
  • STATISTICA Data Miner can process, read, and write data from virtually all standard file formats, can directly access and process/score databases (even without performing explicit import/export operations), and it can import and export files from legacy (or competing) products
  • STATISTICA Data Miner also provides the most effective data pre-processing, cleaning, and filtering tools for effective feature selection from among thousands (or even a million) of candidate predictors, automatic optimal binning, options to merge multiple data sources, align data based on multiple criteria including time-stamps at unequal intervals (data aggregation), deal with missing data, remove duplicate records, outliers, etc.
  • STATISTICA Data Miner provides effective wizards (Data Miner Recipes) to get to useful results solutions quicklySTATISTICA Data Miner also provides the familiar workspace drag-and-drop interface to create custom workflows, allows interactive detailed drill down into specific intermediate and final results, and is fully programmable and customizable
  • STATISTICA Data Miner can produce prediction models in various formats, including PMML, C++ (C#), Java and other common programming/scoring languages (e.g., SAS, stored database procedures); theSTATISTICA Rapid Deployment engine (included) allows you to move directly from modeling to deployment and scoring of live data, data bases, etc.
  • STATISTICA Data Miner is fully integrated into the STATISTICA line of products, e.g., can be used for process optimization and advanced model-based process monitoring, automatic scoring of live data using STATISTICA Enterprise, etc.
  • STATISTICA Data Miner can be used as a desktop application or run in a powerful client-server architecture (for server-based parallel processing of multiple analyses, with load balancing to manage large numbers of users, and options for scheduled batch tasks/processing)