Chapter IX.Data quality and metadata
A.Quality and its dimensions
9.1Quality. Energydata made available to users are the end product of a complex process comprising many stages,including the collection of data from various sources, data processing,data formatting to meet user needs and finally, data dissemination. Achieving overall data quality is dependent upon ensuring quality in all stages of the process. Data quality is assessed based on whether or not usersare provided with information adequate for their intended use. In other words, the quality is judgedby its“fitness for use.” For example, users must be able to verify that the conceptual framework and definitions that would satisfy their particular needs are the same as, or sufficiently close to, those employed in collecting and processing the data. Users mustalso be able to assess the degree to which the accuracy of the data is consistent with their intended use or interpretation. All the measures that responsible agencies take to assure data quality constitute quality management. Countries are encouraged to develop their own national energy data quality management programmes, to document these programmes, to develop measures of data quality, and to make these available to users.
9.2Data quality assessment frameworks. Most international organizations and countries have developed general definitions of data quality, outlining the various dimensions (aspects) of quality and quality measurement and integrating them into quality assessment frameworks.[1] Although the existing quality assessment frameworks differ to some extent in their approaches to quality and in the number, name and scope of quality dimensions (see figure VIII.1), they complement each other and provide comprehensive and flexible structures for the qualitative assessment of a broad range of statistics, including energy statistics. For example:
(a)The IMF Data Quality Assessment Framework (DQAF) takes a holistic view of data quality and includes governance of statistical systems, core statistical processes and statistical products. The Framework is organized as a cascading structure covering the prerequisites and five dimensions of quality: assurance of integrity, methodological soundness, accuracy and reliability, serviceability and accessibility;
(b)The European Statistical System (ESS) focuses more on statistical outputs and defines the quality of statistics with reference to six criteria: relevance, accuracy, timeliness and punctuality, accessibility and clarity, comparability and coherence;
(c)The OECD quality measurement framework views quality as a multifaceted concept. As with the Eurostat approach, the quality characteristics depend on user perspectives, needs and priorities, which vary across groups of users. Quality is viewed in terms of seven dimensions: relevance, accuracy, credibility, timeliness, accessibility, interpretability and coherence.
9.3The relationship between Quality Assessment Frameworks. The overall aim of the these quality assessment frameworks is to standardize and systematize statistical quality measurement and reporting across countries. They allowthe assessmentof national practices in energy statistics in terms of internationally (or regionally) accepted approaches for data quality measurement. The quality assessment frameworks could be used in a number of contexts, including for (a) guiding countries’ efforts towards strengthening their statistical systems by providing a self-assessment tool and a means of identifying areas for improvement; (b) technical assistance purposes; (c) reviews of a country’s energy statistics domainas performed by international organizations; and (d) assessments by other groups of data users.
9.4Dimensions of quality. National agencies responsible for energy statistics can decide to implement one of the existing frameworks for quality assessment of any type of statistics, including energy statistics, either directly or by developing, on the basis of those frameworks, national quality assessment frameworks that fit best their country’s practices and circumstances. The following dimensions of quality, which reflect a broad perspective and in consequence have been incorporated in most of the existing frameworks, should be taken into account in developing quality assessment frameworks for measuring and reporting the quality of statistics in general and energy statistics in particular: prerequisites of quality, relevance, credibility, accuracy, timeliness, methodological soundness, coherence and accessibility. They are described in greater detail directly below:
(a)Prerequisites of quality. Prerequisites of quality refer to all institutional and organizational conditions that have an impact on the quality of energy statistics. The elements within this dimension include: the legal basis for compilation of data; adequacy of datasharing and coordination among dataproducing agencies; assurance of confidentiality; adequacy of human, financial, and technical resources for the operation of energy statistics programmes and the implementation of measures to ensure their efficient use; and quality awareness;
(b)Relevance. The relevance of energy statistics reflects the degree to which they meet the real needs of users. Therefore, measuring relevance requires the identification of user groups and their data needs. The responsible agencies should balance the different needs of current and potential users by delivering a programme that goes as far as possible towards satisfying the most important needs of key users in terms of the content of energy data, coverage, timeliness, etc., given resource constraints. Strategies for measuring relevance include tracking requests from users and the ability of the energy statistics programme to respond, conducting users’ satisfaction surveys and studying their results, andconsulting directly with key users about their interests and their views of the gaps and deficiencies in the energy statistics programme;
(c)Credibility.[2]The credibility of energy statistics refers to the confidence that users place in those data based on the reputation of the responsible agencies producing the data. The confidence of users is built over time. One important aspect of credibility is trust in the objectivity of the data, which implies that the data are perceived to be produced professionally in accordance with appropriate statistical standards, and that policies and practices are transparent. For example, data should not be manipulated, withheld or delayed, nor should their release be influenced by political pressure;
(b)Accuracy. The accuracy of energy statistics refers to the degree to which the data correctly estimate or describe the quantities or characteristics that they have been designed to measure. It has many facets and in practice there is no single overall measure of accuracy. In general, it is characterized in terms of errors in statistical estimates and is traditionally decomposed into bias (systematic error) and variance (random error) components. However, it also encompasses the description of any processes undertaken by responsible agencies to reduce measurement errors. In the case of energy estimates based on data from sample surveys, the accuracy can be measured using the following indicators: coverage rates, sampling errors, non-response errors, response errors, processing errors, and measuring and model errors. Regular monitoring of the nature and extent of revisions toenergy statistics are considered a gauge of reliability;
(c)Timeliness. The timeliness of energy statistics refers to the lag between the reference point (or the end of the reference period) to which the information pertains, and the date on which the information becomes available. The timeliness of information will influence its relevance and utility for users, but may result in a trade-off against accuracy. Timeliness is closely tied to the existence of an output release schedule. Such a schedule may comprise a set of target release dates or may entail a commitment to release energy data within a prescribed time period following their receipt.One measure of timeliness is the elapsed timebetween the identified release date and the effective dissemination date of energy data. Another measure would be the extent to which the programme meets its target dates;
(d)Methodological soundness. Methodological soundness is a dimension that encompasses the application of international standards, guidelines and good practices in the production of energy statistics. The adequacy of the definitions and concepts, sample design, frame quality, variables and terminology underlying the data, and the information describing the limitations of the data, if any, largely determines the degree of adherence of a particular dataset to international standards. The metadata provided along with energy statistics play a crucial role in assessing the methodological soundness of data. They inform the users on how close to the target variable (for example, any of the data items) are the input variables used for their estimation. When there is a significant difference, there should be an explanation of the extent to which this may cause a bias in the estimation of data items. Methodological soundness is closely related to the interpretability of data, which depends on all of the features of the information on energy data mentioned above and reflects the ease with which the user may understand and properly use and analyse the data;
(e)Coherence. The coherence of energy statistics reflects the degree to which the data are logically connected and mutually consistent, that is to say, the degree to which they can be successfully brought together with other statistical information within a broad analytical framework and over time. The use of standard concepts, definitions, classifications and target populations promotes coherence, as does the use of a common methodology across surveys. Coherence,which does not necessarily imply full numerical consistency, has four important sub-dimensions:
(i)Coherence within a dataset.This implies that the elementary data items are based on compatible concepts, definitions and classifications and can be meaningfully combined. For energy statistics, this sub-dimension governs the need for all data items to be compiled in conformity with the methodological basis of the recommendations presented in IRES;
(ii)Coherence across datasets.This implies that the data across different datasets are based on common concepts, definitions and classifications. The coherence between energy statistics and other statistics (e.g., economic, environmental) will be ensured if all data sets are based on common concepts, definitions, valuation principles, classifications, etc., and as long as any differences are explained and can be allowed for;
(iii) Coherence over time.This implies that the data are based on common concepts, definitions and methodology over time. This property will be established if, for example, an entire time series of energy data is compiled on the basis of the recommendations in IRES. If this is not the case, it is advisable that countries clearly note the divergences from the recommendations; and
(iv) Coherence across countries.This implies that the data are based on common concepts, definitions and methodology across countries. Coherence of energy statistics across countries may be dependent upon the extent to which the recommendations in IRES have been adopted;
(f)Accessibility. The accessibility of energy statistics refers to the ease with which the data can be obtained from the responsible agencies, including the ease with which the existence of information can be ascertained, as well as the suitability of the form or the media of dissemination through which the information can be accessed. Aspects of accessibility also include costs, the availability of metadata and the existence of user support services. Accessibility requires development of an advance release calendar so that the users will be informed well in advance about when and where the data will be available and how to access them.
9.5Interconnectedness and trade-offs. The dimensions of quality described above are overlapping and interconnected and as such,are involved in a complex relationship. Action taken to address or modify one aspect of quality will tend to affect other aspects. For example, there may be a trade-off between aiming for the most accurate estimation of the total annual energy production or consumption by all potential producers and consumers, and providing this information in a timely mannerwhen it is still of interest to users. It is recommended that if, while compiling a particular energy statistics dataset,countries are not in a position to meet the accuracy and timeliness requirements simultaneously, they should produce a provisional estimate, which would be available soon after the end of the reference period but would be based on less comprehensive data content. [KT1]This estimate would be supplemented at a later date with information based on more comprehensive data content but would be less timely than its provisional version. If there is no conflict between these two quality dimensions, there will of course be no need to producesuch estimates.
9.6Measurement of quality. The measurement of the quality of any statistical data, including energy statistics, is not a simple task. Problems arise from the difficulties involved in quantifying the levels of individual dimensions and in aggregating the levels of all dimensions. Under these circumstances, deriving a single quantitative measure of quality is not possible. In the absence of such a single measure, countries are encouraged to use a system of quality measures/indicators (see section B below),to develop their own quality assessment frameworks based on the abovementioned approaches, taking into consideration the specific circumstances of their economies, and to regularly issue quality reports as part of their metadata. The quality framework offersresponsible agencies a practical approach to providing data that meet different users’ needs, while the provision of quality information allows users to judge for themselves whether a dataset meets their particular quality requirements. It is recommended that a quality review of energy statistics be undertaken every four to five years, or more frequently if significant methodological or other changes in the data sources occur.
B.Quality measures and indicators
9.7Quality measures. Quality measures are defined as those items that directly measure a particular aspect of quality. For example, the time lag from the reference date to the release of particular energy statistics is a direct quality measure of timeliness. However, in practice, many quality measures can be difficult or costly to calculate. In these cases, quality indicators can be used to supplement or act as substitutes for the desired quality measurement.
9.8Quality indicators. Quality indicators can be described as quantitative data that provide evidence about the quality or standard of the data collected by statistical agencies.They are linked to the achievement of particular goals or objectives. Unlike ordinary raw statistics, quality indicators are generally conceptualized in terms of having some reference point and, so structured, can assist in making a range of different types of comparisons.
9.9Quality indicators as indirect measures.Quality indicators usually consist of information that is a by-product of the statistical process. They do not measure quality directly but can provide enough information for the assessment of a quality. For example, in respect to accuracy, it is very challenging to measure non-response bias as the characteristics of non-respondents can be difficult and costly to ascertain. In this instance, the response rate is often utilized as a proxyto provide a quality indicator of the possible extent of non-response bias. Other data sources can also serve as a quality indicator to validate or confront the data. For example, in the energy balances, energy consumption data can be compared with production figures to flag potential problem areas.[KT2]
9.10Selection of quality measures and indicators. It is not intended that all quality dimensions should be addressed for all data. Instead, countries are encouraged to select those quality measures/indicators that together provide an assessment of the overall strengths, limitations and appropriate uses of a given dataset. Certain types of quality measures and indicators can be produced for each data item. For example, response ratesfor total energy production can be calculated and disseminated with each new estimate. Alternatively, other measures could be produced once for all data items and would be rewritten only if there were changes. The latter case is exemplified in the description of survey approaches to data collection for the quality dimension “methodological soundness”, which would be applicable to all energy statistics data items.
9.11Defining quality indicators. When countries define the quality indicators for energy statistics,it is recommended that they ensure that the indicators satisfy the following criteria: (a) they cover part or all of the dimensions of quality as defined previously; (b) the methodology for their compilation is well established; and (c) the indicators are easy to interpret.
9.12Types of quality indicators. Quality indicators can be classified according to their importance as follows:
(a)Key indicators,which ought to fulfil the criteria given in paragraph9.11. Two examples of key quality indicators are: the coefficient of variation which measures the accuracy of energy statistics obtained through sample surveys, and the elapsed time between the end of the reference period and the date of the first release of data, which measures the timeliness of energy statistics;
(b)Supportive indicators,which fulfil the criteria in paragraph 9.11 to the extent that they are considered important as indirect measures of the data quality. Such an indicator, for example, is the average size of revisions undertaken between the provisional and final estimates of a particular dataset, which is an indicator of the accuracy of energy statistics;
(c)Indicators for further analysis,which are subject to further examination and discussion on the part of responsible agencies. After a careful analysis of the responsible agencies capabilities and available resources, for example, some countries may decide to conduct a user satisfaction survey and calculate a user satisfaction index for measuring the relevance of energy statistics.
9.13Balance of indicators. It is recommended that careful attention be paid by countries to maintaining an appropriate balance between different dimensions of quality and the number of indicators. The objective of quality measurement is to have a practical set (limited number) of indicators which can be used to monitor over time the quality of the energy data produced by the responsible agencies and to ensure that users are provided with a useful summary of overall quality, while not overburdening respondents with demands for unrealistic amounts of metadata.