Verifying ESG Metrics Through Evidence, Recalculation, and Site Procedures
Feb 7, 2025
How practitioners test ESG metrics using source records, recalculation, sampling, site visits, benchmarking, and data-control procedures.
On this page
ESG metrics can be quantitative, qualitative, historical, estimated, entity-specific, or derived from third-party information. The assurance challenge is to connect each reported metric to suitable criteria, a defined reporting boundary, reliable source data, and procedures that support the conclusion.
For CPA exam purposes, treat ESG metrics like any other subject matter: identify what is being measured, understand how management calculated it, test the data and controls, evaluate estimates and assumptions, and consider whether the disclosure is complete and not misleading.
flowchart LR
A["Reported ESG metric"] --> B["Criteria and boundary"]
B --> C["Source data and controls"]
C --> D["Verification procedures"]
D --> E["Exceptions and adjustments"]
E --> F["Supported assurance conclusion"]
Common ESG Metric Types
ESG metrics vary widely, but the verification questions are consistent: What is the metric? What does it include? What does it exclude? What evidence supports it?
Metric type
Examples
Verification focus
Environmental
Scope 1 emissions, Scope 2 emissions, water usage, waste diverted, energy intensity
HR records, incident logs, policy definitions, population completeness, privacy limits
Governance
Board independence, ethics training completion, hotline cases, anti-corruption controls
Governance records, training systems, case logs, policy evidence, approval trails
Environmental measures often involve calculations and conversion factors. Social measures often involve population completeness and definition consistency. Governance measures often involve whether policies, oversight, and evidence support the reported claim.
Verification Procedures
Verification procedures should respond to the metric’s risk. A high-volume, estimate-based emissions metric requires different work than a board composition disclosure.
Common procedures include:
Inspection: compare reported data to source records such as utility bills, fuel invoices, HR reports, training logs, safety records, or board minutes.
Recalculation: recompute metrics using management’s source data, formulas, and approved conversion factors.
Sampling: select transactions, facilities, months, employees, suppliers, or incidents for detailed testing.
Observation or site visit: observe processes, meters, waste handling, safety conditions, or facility controls.
Inquiry: ask responsible personnel about collection methods, review controls, unusual changes, and limitations.
Analytical procedures: compare current-period metrics to prior periods, production levels, headcount, facility changes, or industry expectations.
Control testing: evaluate whether data inputs, approvals, system access, and change controls prevent or detect errors.
No single procedure proves every metric. The practitioner should combine procedures to address completeness, accuracy, occurrence, cutoff, classification, and presentation risks.
Recalculation and Emissions Metrics
Many emissions metrics are calculated rather than directly observed. A simplified emissions calculation is:
Activity data may be gallons of fuel, kilowatt-hours, miles traveled, tons of material processed, or other measurable activity. The emission factor converts the activity into an emissions estimate under the stated criteria.
When recalculating emissions, the practitioner should evaluate:
whether activity data is complete for the reporting boundary
whether the correct period was used
whether conversion factors are current and appropriate
whether units were converted correctly
whether estimates and assumptions are documented
whether manual adjustments were reviewed and approved
The exam trap is assuming that a precise-looking emissions number is reliable. A calculated metric can be wrong if the activity data is incomplete, the factor is outdated, the units are inconsistent, or the reporting boundary excludes relevant facilities.
Scope 1, Scope 2, and Scope 3
Greenhouse gas reporting commonly distinguishes Scope 1, Scope 2, and Scope 3 emissions.
Scope
Meaning
Evidence challenge
Scope 1
Direct emissions from owned or controlled sources
Completeness of fuel, equipment, vehicle, and facility data
Scope 2
Indirect emissions from purchased electricity, steam, heating, or cooling
Completeness of utility data and correct location- or market-based factors
Scope 3
Other indirect value-chain emissions
Reliance on suppliers, estimates, models, and external data
Scope 3 is often the hardest to verify because it may depend on supplier data, spend-based estimates, product-use assumptions, transportation data, or third-party models. A practitioner should consider whether the disclosure explains methodology and limitations clearly enough for users.
Data Controls and Completeness
Controls over ESG data are often less mature than controls over financial reporting. Data may be collected by operations, facilities, human resources, legal, procurement, or sustainability teams before finance reviews it.
Useful control questions include:
Is the metric owner identified?
Is the reporting boundary documented?
Are source systems complete and access-controlled?
Are manual spreadsheets reviewed?
Are formula changes approved?
Are unusual period-over-period changes investigated?
Are third-party data submissions validated?
Are assumptions and estimates retained in workpapers?
Completeness is frequently the hardest assertion. The practitioner may need to reconcile facilities, employees, vehicles, suppliers, or meters to a master population before testing the metric.
Benchmarking and Reasonableness
Benchmarking does not replace source testing, but it can identify unexpected relationships. For example, emissions may be compared with production volume, energy expense, facility square footage, headcount, mileage, or prior-year activity.
Unexpected changes require follow-up. A decrease in emissions may be valid because a facility closed, renewable energy was purchased, or production changed. It may also signal missing data, a changed boundary, or an incorrect conversion factor.
Common Pitfalls
Testing only the calculation without testing source-data completeness.
Accepting supplier-provided Scope 3 data without evaluating reliability.
Ignoring unit conversions and emission-factor versions.
Treating software output as evidence without testing inputs and controls.
Failing to distinguish measured data from estimated data.
Reporting benchmarking results as proof instead of a reasonableness check.
Quick Review
ESG metric verification is evidence work. The practitioner identifies the metric, criteria, and boundary; evaluates source data and controls; performs procedures such as inspection, recalculation, sampling, observation, inquiry, and analytics; then considers whether exceptions affect the assurance conclusion.
Review Questions
### Which procedure best describes recalculation of an emissions metric?
- [ ] Asking management whether the metric looks reasonable.
- [x] Recomputing emissions from activity data and the applicable emission factor.
- [ ] Reading a sustainability report for grammar.
- [ ] Comparing the company's logo to peers.
> **Explanation:** Recalculation independently recomputes the metric using source data and the relevant formula or factor.
### What is the formula relationship commonly used for emissions estimates?
- [x] Emissions equal activity data multiplied by an emission factor.
- [ ] Emissions equal net income divided by market capitalization.
- [ ] Emissions equal headcount multiplied by revenue.
- [ ] Emissions equal assets minus liabilities.
> **Explanation:** Many emissions measures convert activity data into emissions using emission factors.
### Which item is normally a Scope 1 emissions source?
- [ ] Purchased electricity.
- [x] Fuel burned in company-owned equipment.
- [ ] Supplier manufacturing emissions.
- [ ] Customer product use.
> **Explanation:** Scope 1 covers direct emissions from owned or controlled sources.
### Why is Scope 3 often harder to verify than Scope 1?
- [ ] Scope 3 is always directly metered by the company.
- [ ] Scope 3 never uses estimates.
- [x] Scope 3 often depends on suppliers, models, assumptions, and external data.
- [ ] Scope 3 is limited to board minutes.
> **Explanation:** Value-chain emissions often involve third-party data and estimation uncertainty.
### What is a common completeness procedure for ESG metrics?
- [ ] Test only the final spreadsheet total.
- [x] Reconcile facilities, meters, employees, vehicles, or suppliers to a master population.
- [ ] Ignore excluded locations.
- [ ] Use only management's verbal explanation.
> **Explanation:** Completeness testing often starts by validating the population included in the metric.
### What is the main limitation of benchmarking?
- [ ] It is never useful.
- [x] It identifies anomalies but does not replace direct source testing.
- [ ] It proves all reported metrics are accurate.
- [ ] It can only be used for governance metrics.
> **Explanation:** Benchmarking is a reasonableness check that may identify items requiring further work.
### Which evidence is most relevant to employee turnover verification?
- [ ] Utility bills.
- [ ] Fuel emission factors.
- [x] HR records showing hires, terminations, and employee population.
- [ ] Board independence charters only.
> **Explanation:** Employee turnover should be supported by HR source records and population data.
### What should the practitioner do when ESG software produces a final metric?
- [ ] Accept the output without testing.
- [x] Evaluate inputs, formulas, access controls, changes, and review procedures.
- [ ] Treat the software vendor as the assurance provider.
- [ ] Exclude the metric from all documentation.
> **Explanation:** Software output depends on data inputs, configuration, controls, and review.
### A sharp decrease in emissions compared with production volume should lead the practitioner to:
- [ ] assume management achieved the target.
- [x] investigate whether the change is supported by valid operational changes or indicates missing data or formula issues.
- [ ] delete the comparison from the workpapers.
- [ ] switch from reasonable assurance to no assurance automatically.
> **Explanation:** Unexpected relationships require follow-up and evidence.
### ESG metric verification may require inspection, recalculation, sampling, observation, inquiry, analytics, and control testing.
- [x] True.
- [ ] False.
> **Explanation:** ESG assurance uses familiar evidence procedures adapted to nonfinancial metrics and criteria.