The NIST AI RMF is structured around four functions: Govern, Map, Measure, and Manage.

  • Govern establishes the standing infrastructure — the culture, policies, processes, and accountability structures — that AI risk management depends on.
  • Map establishes context, categorises the AI system, identifies risks and benefits, and characterises impacts.
  • Measure analyses, assesses, benchmarks, and monitors AI risks through quantitative and qualitative methods, including evaluation against the seven trustworthy AI characteristics.
  • Manage prioritises risks, allocates resources to address them, and responds to incidents.

Each function contains categories (numbered, for example, Govern 1 through Govern 6) and subcategories (Govern 1.1, Govern 1.2, and so on) that specify the outcomes the organisation seeks to achieve.

“The AI RMF Core is composed of four functions: GOVERN, MAP, MEASURE, and MANAGE. Each of these high-level functions is broken down into categories and subcategories. Categories and subcategories are subdivided into specific actions and outcomes.” — NIST AI RMF 1.0, Part 2

Key definitions

TermMeaning
FunctionOne of the four top-level structural elements of the framework: Govern, Map, Measure, Manage
CategoryA grouping of related outcomes within a function. Categories are numbered within each function (e.g. Govern 1, Govern 2).
SubcategoryA specific outcome the organisation should achieve within a category — the framework’s most granular level (e.g. Govern 1.1, Manage 4.3)
OutcomeWhat the organisation should achieve at a given subcategory — stated as a result, not as a prescribed activity
Suggested actionA concrete activity proposed in the Playbook to achieve a subcategory outcome; voluntary, not required
Iterative functionA function performed continuously rather than once, with each iteration informed by outputs from the others

How NIST AI RMF functions relate

The four functions are not performed in a single linear sequence. Govern provides standing infrastructure that operates continuously. Map, Measure, and Manage operate continuously as systems and contexts evolve, with each iteration informed by outputs from the others.

FunctionPosition in the cycleRelationship to other functions
GovernStanding infrastructureUnderpins Map, Measure, and Manage; provides accountability, policies, processes within which the other three operate
MapSituational analysisEstablishes context that Measure and Manage depend on
MeasureEvidence generationProduces analysis and monitoring outputs that Manage acts on; feeds back into Map as context evolves
ManageDecision and actionActs on outputs from Map and Measure; produces feedback that informs future iterations of all three

The framework places Govern first because it provides the structures within which the other three are performed. Map second because it establishes context. Measure third because it requires context to be measured. Manage fourth because it requires measurement to act on. But the framework is explicit that the cycle is iterative — all four functions are performed continuously, not in a single pass.

Function 1: Govern

Govern establishes the culture, policies, processes, accountability structures, and oversight mechanisms that enable an organisation to manage AI risks across the AI lifecycle.

Govern categories

CategoryFocus
Govern 1Policies, processes, procedures, and practices for AI risk management
Govern 2Accountability structures — empowered, responsible, and trained teams
Govern 3Workforce diversity, equity, inclusion, and accessibility
Govern 4Organisational culture that considers and communicates AI risk
Govern 5Processes for engagement with relevant AI actors
Govern 6Policies and procedures for third-party software, data, and supply chain

What Govern produces

Four operational deliverables emerge:

  • AI risk management policies. Documented policies covering AI risk tolerance, processes, and connection to broader organisational risk management. Reviewed periodically as the AI portfolio, regulatory environment, and risk landscape change.
  • Accountability assignments. Named ownership across the lifecycle — risk identification, assessment, mitigation, monitoring, incident response. Authority must match responsibility.
  • Workforce and culture practices. Training, diversity, equity, inclusion, and accessibility considerations integrated into AI development and deployment teams. The framework treats workforce diversity as a substantive risk management measure.
  • Third-party governance. Processes for managing AI risks introduced through third-party software, models, data, and services. Due diligence, contractual provisions, ongoing monitoring, incident communication.

Why Govern matters

Govern is the function that determines whether AI risk management is an organisational capability or a project. The other three functions can be performed competently in isolation and still produce no durable change if Govern is absent. The framework places Govern first because it provides the structures within which Map, Measure, and Manage are continuously performed.

Three failure modes recur:

  • Govern as policy without process. Policies and risk management statements produced without the underlying processes, accountability structures, and review cycles that make them operative.
  • Accountability without authority. Roles assigned without the decision rights, budget, or escalation paths to perform the role. Govern 2 requires authority to match responsibility.
  • Third-party AI treated as procurement, not risk management. Foundation models, AI-enabled SaaS, and AI components from suppliers handled exclusively through ordinary vendor management. Govern 6 requires these to be managed as AI risks.

Function 2: Map

Map establishes the situational understanding on which Measure and Manage depend — what the AI system is, what it does, where it operates, who is affected, what could go wrong, and what risks are worth attending to.

Map categories

CategoryFocus
Map 1Context — intended purposes, deployment settings, users, expectations, potential impacts
Map 2Categorisation of the AI system — capabilities, targeted usage, goals, expected benefits and costs
Map 3AI capabilities, targeted usage, goals, benefits, and costs relative to benchmarks
Map 4Risks and benefits across all components including third-party software and data
Map 5Impacts on individuals, groups, communities, organisations, and society

What Map produces

Five operational deliverables emerge:

  • System context documentation. Intended purpose, deployment setting, user populations, expected interaction patterns, operational environment, and the broader sociotechnical context.
  • System categorisation. Type of AI, task domain, risk-relevant attributes (autonomy, adaptiveness, opacity), benchmarks against which performance and risk are assessed.
  • Capability and goal articulation. What the system is intended to do, the goals it serves, expected benefits, expected costs — including costs to those outside the organisation.
  • Component-level risk and benefit mapping. Risks and benefits attributable to specific components including third-party models, datasets, libraries, and infrastructure.
  • Impact characterisation across five levels. Effects on individuals, groups, communities, organisations, and society. The five-level scope is the framework’s most distinctive feature in Map.

Why Map matters

Map is the function that determines whether subsequent risk management is grounded in reality. Measurements without context produce numbers without meaning; management without understanding produces actions without targets.

Three failure modes recur:

  • Categorisation by technology rather than by function. Systems categorised as “machine learning model” or “large language model” without articulating task, deployment context, and user population.
  • Impact analysis truncated to individual level. Map 5 requires impact analysis across five levels; analyses that stop at individual harms miss the group, community, and societal effects.
  • Third-party components mapped at vendor level rather than component level. Risks mapped as supplier risks rather than as risks attributable to specific model behaviours, training data limitations, or capability boundaries.

Function 3: Measure

Measure analyses, assesses, benchmarks, and monitors AI risks through quantitative and qualitative methods. Where Map identifies what could go wrong, Measure analyses how likely and how severe, and tracks risks over time.

Measure categories

CategoryFocus
Measure 1Appropriate methods and metrics for the AI system’s intended purpose and deployment setting
Measure 2Evaluation of AI systems for trustworthy characteristics
Measure 3Mechanisms for tracking identified AI risks over time
Measure 4Feedback about efficacy of measurement

The seven trustworthy AI characteristics under Measure 2

The framework’s substantive content is concentrated in Measure 2. Each characteristic requires its own measurement approach.

CharacteristicMeasurement focus
Valid and reliablePerformance accuracy, robustness across conditions, reliability over time
SafeHazard identification, fail-safe behaviour, harm avoidance
Secure and resilientAdversarial robustness, attack resistance, recovery capability
Accountable and transparentDocumentation completeness, decision traceability, audit support
Explainable and interpretableMethod-appropriate explanation, stakeholder-appropriate interpretation
Privacy-enhancedPrivacy-preserving techniques, privacy impact, data minimisation
Fair with harmful bias managedBias measurement across groups, mitigation effectiveness, fairness across deployment contexts

The characteristics interact. Optimising for one may reduce performance against another. Trade-offs must be made consciously and documented.

What Measure produces

Five operational deliverables emerge:

  • Measurement methodology. Methods and metrics appropriate to the AI system’s intended purpose, deployment setting, and risk profile.
  • Trustworthy characteristic evaluations. Documented evaluations against each of the seven characteristics where applicable.
  • Risk tracking mechanisms. Standing processes for monitoring identified risks — drift detection, performance monitoring, incident tracking, reassessment triggers.
  • Impact measurement. Quantification and tracking of impacts characterised in Map.
  • Feedback on measurement efficacy. Documented assessment of whether the chosen measurement methods are producing useful information.

Why Measure matters

Measure is the function that produces the evidence on which Manage decisions are based. Without Measure, AI risk management operates on assertion rather than observation.

Three failure modes recur:

  • Measurement of characteristics not relevant to the system. The full set of seven characteristics applied to every system regardless of relevance.
  • Pre-deployment measurement without operational measurement. Systems evaluated thoroughly before deployment and monitored superficially after.
  • Fairness measurement at aggregate level only. Bias and fairness measured across the user population without disaggregation by the groups most likely to experience disparate effects.

Function 4: Manage

Manage allocates resources to address identified risks and responds when things go wrong. It converts the framework’s analytical work into operational decisions and interventions.

Manage categories

CategoryFocus
Manage 1AI risks prioritised, responded to, and managed
Manage 2Strategies to maximise AI benefits and minimise negative impacts
Manage 3AI risks and benefits from third-party entities managed
Manage 4Risk treatments documented, monitored, and improved

What Manage produces

Five operational deliverables emerge:

  • Risk prioritisation and response decisions. Identified risks ranked by impact, likelihood, organisational risk tolerance, and resource constraints. Prioritisation explicit and documented.
  • Benefit maximisation and impact minimisation strategies. Documented strategies that weigh AI benefits against negative impacts and articulate how the balance is maintained operationally.
  • Third-party risk treatments. Risk controls applied to AI components, models, data, and services sourced from third parties — ongoing monitoring, contractual enforcement, incident communication.
  • Response and recovery procedures. Documented procedures for responding to AI incidents, recovering from failures, and communicating with affected parties.
  • Continuous improvement of risk treatments. Treatments monitored for effectiveness, updated as the system and context change, retired when no longer relevant.

Why Manage matters

Manage is the function that determines whether AI risk management changes what the organisation does. The other three functions can produce structures, context, and evidence without altering operational decisions.

Three failure modes recur:

  • Prioritisation by visibility rather than by analysis. Risks that attract leadership attention or external scrutiny addressed; risks identified through systematic analysis but lacking visibility languish.
  • Benefit articulation absent. Risks documented extensively without articulating the benefits the AI system is intended to produce. Manage 2 expects benefit-cost balancing to be explicit.
  • Treatments implemented and not reviewed. Risk treatments applied at deployment and assumed to remain effective. Manage 4 requires monitoring of treatment effectiveness, updating as conditions change, and retirement when treatments become obsolete.

How the four functions interact in practice

A working implementation produces a continuous flow of activity across the four functions. The table below illustrates how a single AI system moves through the cycle.

StageGovern roleMap roleMeasure roleManage role
System inceptionPolicies and accountability already in placeContext, categorisation, initial risk identification, impact characterisationMethodology selection; initial benchmarkingInitial risk prioritisation; resource allocation
System developmentLifecycle policies governing developmentRefinement of context and risk identification as design choices firm upPre-deployment evaluation against trustworthy characteristicsRisk treatment decisions; mitigations applied
System deploymentDeployment policies and accountabilityDeployment context analysis; final impact characterisationValidation against deployment context; baseline metricsDeployment risk treatments; response procedures activated
Operation and monitoringOngoing governance; engagement with affected partiesReassessment as deployment context evolvesOngoing monitoring; drift detection; incident trackingResponse to detected issues; treatment effectiveness review
Incident or changeIncident escalation through governance channelsReassessment of risks and impactsInvestigation and measurement of the incidentResponse, communication, recovery; corrective treatments

The cycle never closes definitively for a deployed AI system. Map, Measure, and Manage operate continuously, and Govern provides the standing infrastructure within which they operate. Static implementations — those that perform the four functions once at deployment and never revisit them — lose value as the system and its context evolve.

How NIST AI RMF functions integrate with other governance instruments

For organisations operating multiple frameworks, the four functions integrate with adjacent governance instruments in defined ways.

InstrumentHow the four functions integrate
ISO/IEC 42001Govern aligns with Clauses 4 and 5; Map and Measure align with Clause 6 and Annex A.5–A.7; Manage aligns with Clause 8 and Clause 10
EU AI ActMap and Measure support Articles 9, 10, 14; Manage supports Articles 72 and 73; Govern supports Article 17 quality management
ISO/IEC 23894Map and Measure align with the AI risk management methodology of ISO/IEC 23894
Sector frameworksProfiles adapt the four functions to specific sectors — healthcare, financial services, public safety

The four functions provide the structural backbone within which other instruments’ substantive content operates. Organisations using multiple frameworks typically anchor implementation on the four functions and map other instruments’ requirements to them.

NIST AI RMF resource references

The primary references for working with the four functions:

ResourceSource
NIST AI RMF 1.0https://www.nist.gov/itl/airc — the core framework defining the four functions
AI RMF Playbookhttps://airc.nist.gov/airmf-resources/playbook/ — suggested actions for each subcategory across all four functions
NIST AI 600-1: Generative AI Profilehttps://www.nist.gov/itl/airc — generative AI risks mapped to the four functions
AI RMF Crosswalkshttps://www.nist.gov/itl/airc — mappings of the four functions to ISO/IEC 42001, EU AI Act, OECD principles, others
AI Resource Center (AIRC)https://airc.nist.gov — central hub for AI RMF resources

The Playbook is the most directly useful reference for operational work on the four functions. For each subcategory across Govern, Map, Measure, and Manage, it provides suggested actions, references, documentation considerations, and transparency considerations.

FAQ

Are NIST AI RMF four functions performed in order?

Not strictly. The framework places them in a logical sequence — Govern first, then Map, Measure, Manage — but in operation the four functions run continuously and iteratively. Govern is standing infrastructure; Map, Measure, and Manage are repeated as systems and contexts evolve. Each iteration of any function informs the others.

Can I focus on one or two functions and defer the others?

The framework expects all four functions to be operative for the AI systems within scope. Focusing on Measure without Govern produces measurement without accountability; focusing on Manage without Map produces decisions without context. Partial implementations are sometimes pragmatic during early adoption, but they should be understood as transitional rather than as steady states.

Which NIST AI RMF function is most demanding?

There is no single answer. Govern is most demanding when starting from no existing governance infrastructure — the policies, accountability structures, and processes are new work. Measure is most demanding for organisations with complex AI portfolios — the substantive evaluation methodology depth is significant. Map is most demanding for organisations with novel use cases — the context analysis cannot draw on standard templates. Manage is most demanding for organisations with mature governance and complex risks — the decisions and trade-offs require established analytical foundations.

How are NIST AI RMF categories and subcategories used in practice?

Categories group related outcomes within a function and provide a high-level structure for adoption planning. Subcategories specify the outcomes themselves and are the level at which the Playbook provides suggested actions. Organisations typically adopt the framework subcategory by subcategory, using the Playbook to translate each outcome into operational activities.

Does NIST AI RMF framework specify how to perform each function?

The framework specifies outcomes, not methods. It states what should be achieved under each subcategory but does not prescribe how. The Playbook provides suggested actions; profiles provide context-specific adaptations; external resources (NIST publications, academic literature, sector guidance) provide methodology. The framework treats methodology as the organisation’s responsibility within its context.

How does the Govern function relate to existing organisational governance?

Govern is designed to integrate with existing organisational governance — enterprise risk management, board oversight, executive accountability, compliance — rather than to replace it. The framework expects AI risk management to operate within the broader governance infrastructure, with AI-specific elements added where needed. Organisations with mature general governance typically find Govern less demanding than organisations starting from limited governance maturity.

Why are there five categories in Map but only four in Measure and Manage?

The category counts reflect the substantive territory each function covers. Map’s five categories cover context, categorisation, capability articulation, risk and benefit mapping, and impact characterisation — five distinct activities. Measure’s four categories cover method selection, characteristic evaluation, risk tracking, and measurement efficacy. Manage’s four cover prioritisation, benefit-impact balancing, third-party management, and treatment improvement. The counts are not significant in themselves; what matters is the substantive coverage.

What is the relationship between the NIST AI RMF functions and the seven trustworthy AI characteristics?

The characteristics are the substantive goals of AI risk management — what the four functions are designed to achieve in operation. Measure 2 is the most direct point of integration, where the characteristics are evaluated explicitly. But the characteristics also inform Map (impact characterisation), Govern (policy framing), and Manage (treatment selection). The framework expects the characteristics to be the working vocabulary across all four functions.

How long does it take to operationalise all NIST AI RMF functions?

Adoption is gradual. Organisations typically work through the four functions over six to eighteen months, with Govern foundational activity preceding substantive Map, Measure, and Manage work on specific AI systems. Full operation across all functions and all AI systems takes longer for organisations with large AI portfolios. Because the framework is voluntary and non-certifiable, there is no equivalent to the audit gates that structure ISO certification timelines.

Can I adapt the NIST AI RMF functions for sector-specific use?

Yes. The profile mechanism is designed for this. A profile adapts the four functions to a specific use case, sector, or technology — identifying which subcategories apply most heavily, what risks are distinctive, and what suggested actions are appropriate. The Generative AI Profile (NIST AI 600-1) is an example. Organisations operating in contexts without a published profile commonly develop internal profiles describing their adaptation.

Back to Blog