Verifying ESG Metrics Through Evidence, Recalculation, and Site Procedures

How practitioners test ESG metrics using source records, recalculation, sampling, site visits, benchmarking, and data-control procedures.

ESG metrics can be quantitative, qualitative, historical, estimated, entity-specific, or derived from third-party information. The assurance challenge is to connect each reported metric to suitable criteria, a defined reporting boundary, reliable source data, and procedures that support the conclusion.

For CPA exam purposes, treat ESG metrics like any other subject matter: identify what is being measured, understand how management calculated it, test the data and controls, evaluate estimates and assumptions, and consider whether the disclosure is complete and not misleading.

    flowchart LR
	    A["Reported ESG metric"] --> B["Criteria and boundary"]
	    B --> C["Source data and controls"]
	    C --> D["Verification procedures"]
	    D --> E["Exceptions and adjustments"]
	    E --> F["Supported assurance conclusion"]

Common ESG Metric Types

ESG metrics vary widely, but the verification questions are consistent: What is the metric? What does it include? What does it exclude? What evidence supports it?

Metric type Examples Verification focus
Environmental Scope 1 emissions, Scope 2 emissions, water usage, waste diverted, energy intensity Meters, fuel records, utility bills, emission factors, facility boundary, calculation model
Social Employee turnover, safety incidents, training hours, workforce demographics, supplier audit results HR records, incident logs, policy definitions, population completeness, privacy limits
Governance Board independence, ethics training completion, hotline cases, anti-corruption controls Governance records, training systems, case logs, policy evidence, approval trails

Environmental measures often involve calculations and conversion factors. Social measures often involve population completeness and definition consistency. Governance measures often involve whether policies, oversight, and evidence support the reported claim.

Verification Procedures

Verification procedures should respond to the metric’s risk. A high-volume, estimate-based emissions metric requires different work than a board composition disclosure.

Common procedures include:

  1. Inspection: compare reported data to source records such as utility bills, fuel invoices, HR reports, training logs, safety records, or board minutes.
  2. Recalculation: recompute metrics using management’s source data, formulas, and approved conversion factors.
  3. Sampling: select transactions, facilities, months, employees, suppliers, or incidents for detailed testing.
  4. Observation or site visit: observe processes, meters, waste handling, safety conditions, or facility controls.
  5. Inquiry: ask responsible personnel about collection methods, review controls, unusual changes, and limitations.
  6. Analytical procedures: compare current-period metrics to prior periods, production levels, headcount, facility changes, or industry expectations.
  7. Control testing: evaluate whether data inputs, approvals, system access, and change controls prevent or detect errors.

No single procedure proves every metric. The practitioner should combine procedures to address completeness, accuracy, occurrence, cutoff, classification, and presentation risks.

Recalculation and Emissions Metrics

Many emissions metrics are calculated rather than directly observed. A simplified emissions calculation is:

[ \text{Emissions} = \text{Activity Data} \times \text{Emission Factor} ]

Activity data may be gallons of fuel, kilowatt-hours, miles traveled, tons of material processed, or other measurable activity. The emission factor converts the activity into an emissions estimate under the stated criteria.

When recalculating emissions, the practitioner should evaluate:

  • whether activity data is complete for the reporting boundary
  • whether the correct period was used
  • whether conversion factors are current and appropriate
  • whether units were converted correctly
  • whether estimates and assumptions are documented
  • whether manual adjustments were reviewed and approved

The exam trap is assuming that a precise-looking emissions number is reliable. A calculated metric can be wrong if the activity data is incomplete, the factor is outdated, the units are inconsistent, or the reporting boundary excludes relevant facilities.

Scope 1, Scope 2, and Scope 3

Greenhouse gas reporting commonly distinguishes Scope 1, Scope 2, and Scope 3 emissions.

Scope Meaning Evidence challenge
Scope 1 Direct emissions from owned or controlled sources Completeness of fuel, equipment, vehicle, and facility data
Scope 2 Indirect emissions from purchased electricity, steam, heating, or cooling Completeness of utility data and correct location- or market-based factors
Scope 3 Other indirect value-chain emissions Reliance on suppliers, estimates, models, and external data

Scope 3 is often the hardest to verify because it may depend on supplier data, spend-based estimates, product-use assumptions, transportation data, or third-party models. A practitioner should consider whether the disclosure explains methodology and limitations clearly enough for users.

Data Controls and Completeness

Controls over ESG data are often less mature than controls over financial reporting. Data may be collected by operations, facilities, human resources, legal, procurement, or sustainability teams before finance reviews it.

Useful control questions include:

  • Is the metric owner identified?
  • Is the reporting boundary documented?
  • Are source systems complete and access-controlled?
  • Are manual spreadsheets reviewed?
  • Are formula changes approved?
  • Are unusual period-over-period changes investigated?
  • Are third-party data submissions validated?
  • Are assumptions and estimates retained in workpapers?

Completeness is frequently the hardest assertion. The practitioner may need to reconcile facilities, employees, vehicles, suppliers, or meters to a master population before testing the metric.

Benchmarking and Reasonableness

Benchmarking does not replace source testing, but it can identify unexpected relationships. For example, emissions may be compared with production volume, energy expense, facility square footage, headcount, mileage, or prior-year activity.

Unexpected changes require follow-up. A decrease in emissions may be valid because a facility closed, renewable energy was purchased, or production changed. It may also signal missing data, a changed boundary, or an incorrect conversion factor.

Common Pitfalls

  • Testing only the calculation without testing source-data completeness.
  • Accepting supplier-provided Scope 3 data without evaluating reliability.
  • Ignoring unit conversions and emission-factor versions.
  • Treating software output as evidence without testing inputs and controls.
  • Failing to distinguish measured data from estimated data.
  • Reporting benchmarking results as proof instead of a reasonableness check.

Quick Review

ESG metric verification is evidence work. The practitioner identifies the metric, criteria, and boundary; evaluates source data and controls; performs procedures such as inspection, recalculation, sampling, observation, inquiry, and analytics; then considers whether exceptions affect the assurance conclusion.

Review Questions

### Which procedure best describes recalculation of an emissions metric? - [ ] Asking management whether the metric looks reasonable. - [x] Recomputing emissions from activity data and the applicable emission factor. - [ ] Reading a sustainability report for grammar. - [ ] Comparing the company's logo to peers. > **Explanation:** Recalculation independently recomputes the metric using source data and the relevant formula or factor. ### What is the formula relationship commonly used for emissions estimates? - [x] Emissions equal activity data multiplied by an emission factor. - [ ] Emissions equal net income divided by market capitalization. - [ ] Emissions equal headcount multiplied by revenue. - [ ] Emissions equal assets minus liabilities. > **Explanation:** Many emissions measures convert activity data into emissions using emission factors. ### Which item is normally a Scope 1 emissions source? - [ ] Purchased electricity. - [x] Fuel burned in company-owned equipment. - [ ] Supplier manufacturing emissions. - [ ] Customer product use. > **Explanation:** Scope 1 covers direct emissions from owned or controlled sources. ### Why is Scope 3 often harder to verify than Scope 1? - [ ] Scope 3 is always directly metered by the company. - [ ] Scope 3 never uses estimates. - [x] Scope 3 often depends on suppliers, models, assumptions, and external data. - [ ] Scope 3 is limited to board minutes. > **Explanation:** Value-chain emissions often involve third-party data and estimation uncertainty. ### What is a common completeness procedure for ESG metrics? - [ ] Test only the final spreadsheet total. - [x] Reconcile facilities, meters, employees, vehicles, or suppliers to a master population. - [ ] Ignore excluded locations. - [ ] Use only management's verbal explanation. > **Explanation:** Completeness testing often starts by validating the population included in the metric. ### What is the main limitation of benchmarking? - [ ] It is never useful. - [x] It identifies anomalies but does not replace direct source testing. - [ ] It proves all reported metrics are accurate. - [ ] It can only be used for governance metrics. > **Explanation:** Benchmarking is a reasonableness check that may identify items requiring further work. ### Which evidence is most relevant to employee turnover verification? - [ ] Utility bills. - [ ] Fuel emission factors. - [x] HR records showing hires, terminations, and employee population. - [ ] Board independence charters only. > **Explanation:** Employee turnover should be supported by HR source records and population data. ### What should the practitioner do when ESG software produces a final metric? - [ ] Accept the output without testing. - [x] Evaluate inputs, formulas, access controls, changes, and review procedures. - [ ] Treat the software vendor as the assurance provider. - [ ] Exclude the metric from all documentation. > **Explanation:** Software output depends on data inputs, configuration, controls, and review. ### A sharp decrease in emissions compared with production volume should lead the practitioner to: - [ ] assume management achieved the target. - [x] investigate whether the change is supported by valid operational changes or indicates missing data or formula issues. - [ ] delete the comparison from the workpapers. - [ ] switch from reasonable assurance to no assurance automatically. > **Explanation:** Unexpected relationships require follow-up and evidence. ### ESG metric verification may require inspection, recalculation, sampling, observation, inquiry, analytics, and control testing. - [x] True. - [ ] False. > **Explanation:** ESG assurance uses familiar evidence procedures adapted to nonfinancial metrics and criteria.
Revised on Monday, June 15, 2026