ECS's Laveena Munsamy writes about data quality and how it is key to making smarter decisions

Becoming Data-Driven Requires A Data Quality Strategy

As businesses in energy, mining, and industrial sectors increasingly turn to digital transformation strategies - Machine Learning, Artificial Intelligence, real-time analytics - the phrase “data-driven” has become more than a buzzword. But here’s the truth: you can’t become data-driven without a clear data quality strategy, writes ECS's Laveena Munsamy, Technical Expert: Data Transformation and Analysis.

Gartner defines data quality as the usability and applicability of data used for an organisation’s priority use cases [1] – simply: is the data fit for its intended purpose?

Data quality is just one aspect of an organisation’s data management strategy. However, its importance is often overlooked and regarded as a mere technical detail.

Data is essential to how an organisation operates and is the foundation of digital transformation strategies such as Machine Learning and Artificial Intelligence. As organisations progress towards becoming data-driven, data quality becomes a critical and necessary requirement.

The lack of data quality in an organisation results in data and analysis errors, bad operational or strategic decisions, manual data handling, long data access times, and long and costly project implementation, thus impacting customer experience, reputation, regulatory risks and opportunity for new markets [2].

With suitable levels of data quality, organisations can improve operational efficiency, make informed decisions, and improve market concentration. Data quality is typically indicated by data quality dimensions.

Gartner states the nine commonly used data quality dimensions as Accessibility, Accuracy, Completeness, Consistency, Precision, Relevancy, Timeliness, Uniqueness and Validity [1].

Other data quality dimensions have also been proposed for use, including Compliance, Confidentiality, Credibility, Currentness, Efficiency, Integrity, Portability, Traceability, Understandability, and Recoverability [3].

There are various software tools available for data quality monitoring, or organisations can choose to develop custom solutions.

Gartner states the nine commonly used data quality dimensions as Accessibility, Accuracy, Completeness, Consistency, Precision, Relevancy, Timeliness, Uniqueness and Validity

It is important that each organisation understands what the acceptable level of data quality is that is required for its operations, services and products, how it can be measured and what actions can be taken to improve it.

The meaning of data quality varies across different industries and individual organisations, as well as the use case or objective of concern.

Example 1

A manufacturing process company is likely to focus on accuracy if the amount of product is the case. Incorrect measurements can result in insufficient product for sales directly impacting revenue and customer trust.

Here, the accuracy of the product volume can be measured by applying profiling analysis or completing verification analysis of random samples. Actions for improvement include ensuring that measurement systems are functioning and maintained.

Example 2

An e-commerce company is likely to focus on timeliness if product inventory levels are the case. Incorrect inventory levels can result in overselling products, impacting financial operational activities, and customer satisfaction.

Timeliness can be measured via the time difference between inventory levels updating and purchase activity. Actions for improvement include optimising the systems’ update rules.

Implementation of data quality strategies can quickly become overwhelming and costly. This has potentially served as a hindrance to data quality strategies being adopted.

However, to realise the full benefit of becoming data-driven and successfully implementing digital strategies, organisations must embark on a data quality journey.

It is recommended that organisations apply a practical approach to data quality – identify the critical business use cases for data quality, implement data-appropriate quality metrics, and then take necessary actions for improvements.

References

1. Data Quality: Why It Matters and How to Achieve It (https://www.gartner.com/en/dataanalytics/topics/data-quality)

2. Quality Management in Data Governance (https://www.deloitte.com/ce/en/services/consulting/perspectives/bg-qualitymanagement-in-data-governance.html)

3. A Framework for Current and New Data Quality Dimensions: An Overview (2024, Miller, R.; Whelan, H.; Chrubasik, M.; Whittaker, D.; Duncan, P.; Gregorio, J.)

FIELD SERVICE AUTOMATION MANAGEMENT

SmartOSR Service

TIME SCALES: Review

OBJECTIVE: Prioritisation Value & Opportunity Potential/Scoping

FOCUS:

  • Initiative & Project Technical Review for Energy and Carbon
  • Key Value Driver Review
  • Initiative & Project Prioritisation and Ranking by Type and Impact
  • Abatement Cost Profiling

S-ARID Solution

TIME SCALES: Continuous/Internal

OBJECTIVE: Energy Management

FOCUS:

  • IIoT & Sensors
  • Fuel Energy (Traditional)
  • Fuel Energy (Green & Renewable)
  • Electrical Energy (Grid Traditional)
  • Electrical Energy (Local & Grid Renewable)

SmartEPS Solution

Time Scales: Long

Objective: Strategic

FOCUS:

  • Global Simulations and Assessments
  • Target Achievement Simulation
  • Scope 1,2 & 3 Emissions
  • Carbon Abatement Assessments
  • Project and Technology Option Assessments
  • Prescription Analytics
  • Performance Tracking (Long-Term)
  • Life of Operations Greenhouse Gas Assessments

SmartRoad Solution

Time Scales: Continuous/Internal

Objective: Energy Measurement

Focus:

  • IIoT & Sensors
  • Fuel energy (traditional)
  • Fuel energy (green & renewable)
  • Electrical energy (grid traditional)
  • Electrical energy (local & grid renewable)

SmartFEMS Solution

Time Scales: Continuous/Internal 

Objective: Energy measurement

Focus: 

  • IIoT & sensors
  • Fuel energy (traditional)
  • Fuel energy (green & renewable)
  • Electrical energy (grid traditional)
  • Electrical energy (local & grid renewable)

SmartDSM Solution

Time Scales: Continuous/Internal 

Objective: Energy Management

Focus:

  • IIoT & Sensors
  • Fuel Energy (Traditional)
  • Fuel Energy (Green & Renewable)
  • Electrical Energy (Grid Traditional)
  • Electrical Energy (Local & Grid Renewable)

PEMS Solution

TIME SCALES: Short /Medium 

OBJECTIVE: Early assessment of actual performance

FOCUS:

  • Short Interval Control
  • Automated Reporting
  • Performance Verification
  • Predictive Analysis
  • Diagnostic Analysis
  • General Prescriptive Analysis
  • Unit Operations Simulation
  • Demand Profiling
  • Statutory Emissions Reporting Support
  • Feeds into other Reporting Systems