Get the data download.

Spectrum Data Discovery Module

Get the insight you need before beginning your data integration, data quality, data governance or master data management projects. Data discovery is the process of scanning data resources to get a complete inventory of your data landscape.

Data discovery software
The Spectrum™ Data Discovery Module can scan structured data unstructured data and semi-structured data using a wide array of data profiling techniques. The results of the scan are used to automatically generate a library of documentation describing your company's data assets and to create a metadata repository.

This documentation and accompanying metadata repository provide the insight you need before beginning data integration data quality data governance or master data management projects.

Our data discovery layer provides companies with a broad set of capabilities to scan and inventory their data landscape. Using these applications organizations can perform the following:

  • Scan data environment (structured data unstructured data semi-structured data)
  • Create and populate a Metadata Repository that can be searched by business and technical users

Our profiling layer equips you with comprehensive data profiling abilities that are simple to understand use and maintain while running your desired tasks.

Key Features of our Data Discovery Software
The Spectrum™ Data Discovery Module offers a number of outstanding features to help you gain better insight and optimize your data assets.

Scan and inventory your data landscape
Use our data discovery capabilities to perform the following tasks:

  • Scan your structured unstructured semi-structured data environment
  • Create and populate a Metadata Repository that can be searched by business and technical users
  • Securely connect to different structured databases across your data landscape
  • Automate the discovery of metadata contained in relational databases and create a comprehensive catalog of enterprise metadata essential for enterprise data integration
  • Scan domains in code files and identify table and column names of selected schemas used in content of scanned code files
  • Scan discover and catalog information contained in a variety of file types including Microsoft Access database fixed length delimited Excel PDF document PPT Plain Text RTF and XSD


Our profiling functions make the following tasks simple to understand use and maintain:

  • Discover and catalog undocumented metadata associated with your global data landscape and build an integrated Enterprise Metadata Repository
  • Identify documented and undocumented ID columns and natural key columns composite keys and multi-column IDs
  • Identify documented inferred and user defined relationships
  • Identify and profile subtypes in a table in an enterprise data landscape
  • Analyze data in delimited files get error value suggestions and catalog them
  • Profile data stored in files without loading them in a database with support for the following file types: flat file (.txt) EBCDIC file (.txt) SAS data file (.sas7bdat) comma delimited (.csv and .txt) pipe delimited (.txt) tab delimited (.tsv & .tab) and XML
  • Analyze data in Access Excel (.xls and .xlsx) and XML files get error value suggestions and catalog them
  • Analyze data and get outlier value suggestions based on time period of occurrence