POSIDEX
Menu
Posidex Data Quality System (PDQS)

Posidex Data Quality System (PDQS) is the product suite that provides comprehensive end-to-end deduplication solution for enterprise-class database systems. Deduplication is the process of systematically identifying duplicate records of a customer from errors and variations in each parameter. It is performed as part of PDQS where the duplicate records are processed resulting in a master database that has unique records of all customers. PDQS forms the foundation for advanced solutions such as clustering, identifying a complete profile of customer from partial database environments. It conducts search despite any errors, variations, and duplications delivering the highest possible reliability when searching, matching, screening, or grouping data based on names, addresses, descriptions, and other parameters of identification.

PDQS is highly sophisticated yet easy to use graphical interface. It is built on top of our innovative search technology leveraging the benefits of scalability to millions of records in seconds, high reliability and accuracy with matches that have less false positives.

Block

PDQS functionalities
  • Cleansing/Standardization
  • Deduplication with bulk and sequential processors
  • Merge
  • Search and Match
  • Configuration and Management Tools
Cleansing/Standardization

PDQS performs standardization by applying the well-defined matching rules to the raw consolidated data. Standardization is full-fledged implementation with built-in default as well as customizable data dictionaries for standardization rules. In the least, there are over 20,000combinations of variations for standardizing the data that depends on the geography, demographics and local customs. For instance, miscellaneous data entries such as tiles (Mr., Mrs., Ms., etc) and address details (attn: s/o, w/o) are removed. Abbreviations in names such as Md. for Mohammad are expanded or replaced to suitable data. Depending on the business case, users can easily add, modify and remove rules.

Deduplication with bulk and sequential processors

Unlike other solutions where data is searched and matched using one size fits all model, Posidex developed different processors that are innovative search and match technologies addressing different data volumes. Customers can choose the appropriate processor technology for their business need.

Deduplication is performed on the standardized data outputted by Data Profiler. It can be performed on entire data or incremental data sets using innovative search and match technologies. It can be performed live data or offline in an extremely efficient and reliable manner.

Merge

Based on user defined grouping criteria, the duplicate data is then processed to result in unique customer data by either horizontal or vertical merge as desired by business needs. In vertical merge criteria, admin selects the last updated record and discards duplicate records of the same customer. In horizontal merge criteria, admin selects partial information from each of the duplicate records to create a unique record.

PDQS allows manual regrouping based on any new information obtained by end-user beyond the system.

Search and Match

This Search and Match process is part of the Identity Resolution. PDQS provides ability to add user and assign roles for access control and built-in security. Multiple end-users can issue queries at the same time for different matches. These searches are highly parallelized and serviced asynchronously. PDQS finds the matches across various databases with millions of records in blazingly fast speeds. The status and results are then sent as output to end-users. For administrators, all the searches performed by users are listed for review. These details of each query and results are very helpful for administrators in fine tuning the matches for improving accuracy in results.

Configuration and Management Tools

PDQS Management has simple to use graphical user interface to manage PDQS. The salient features include user management, auditing and logging, integration, scheduling and reporting.
  • User management for different user roles and permissions for security. Admin role user can provide pre-defined as well as customized global configuration parameter for search results. User can change configuration of a parameter or a set of parameters on the fly for search. The results are displayed on the same page so that users can fine tune local configuration parameters quickly to suit their needs.
  • Auditing and logging: All the queries submitted by users are logged for auditing and fine tuning. Admin role user can see consolidated queries from all users for global configuration parameters.
  • Reporting is performed on master data that is free of duplicates, as needed for business requirements.
Key features of PDQS:
  • Product suite for Data Quality System that provides complete end-to-end deduplication solution.
  • Management utility for security, flexibility and performance tuning.
  • Manages data integrity, identifies duplicate records and prevents duplicates entering into the system.
  • Leverages underlying Posidex search engine technologies for unmatched scalability, reliability and accuracy at production sites.
  • No expensive additional hardware infrastructure is required. Works with entry level commodity servers.
  • Can integrate with existing third party solutions at customer site integrating data across heterogeneous database environments.
For more details on PDQS functionalities, please contact us.


Customers + Partners

Menu
Copyright © 2009 Posidex Technologies P Ltd
Menu