TDX/AI
Rapid Research

Opportunity Analysis

Why now — the UN Decade, AI readiness, and comparable global systems.

Opportunity Analysis: A Global Intelligence System for Transport

Executive Summary

A unique window of opportunity exists to build a Global Intelligence System for Transport -- an integrated data platform with AI-powered querying capabilities (a "Transport Angel" chatbot) -- positioned as a foundational infrastructure of the UN Decade for Sustainable Transport (2026-2035).

The convergence of five factors makes this the right initiative at the right time:

  1. Institutional momentum: The UN Decade formally begins in 2026, with explicit mandates for improved transport data
  2. Technology readiness: Large language models and geospatial AI have matured to the point where natural language querying of complex datasets is technically feasible
  3. Coalition of the willing: Key organisations (FCDO, WRI, GIZ, TRL, University of Oxford, UNDESA, UNECE) are aligned around the need
  4. Climate urgency: Transport accounts for ~24% of global CO2 emissions, and the sector cannot decarbonise without better data
  5. Equity imperative: Developing countries face the largest transport data gaps precisely where evidence-based planning is most needed

This document synthesises research across four dimensions -- the UN Decade context, AI/LLM capabilities, current data gaps, and comparable global systems -- to make the case for action.


Table of Contents

  1. The Problem: Why Global Transport Data is Broken
  2. The Opportunity: Why Now
  3. The Vision: What a Global Intelligence System Could Be
  4. The Transport Angel: AI-Powered Access to Transport Knowledge
  5. The UN Decade as an Enabling Framework
  6. Learning from Other Sectors
  7. The Coalition: Partners and Roles
  8. Risks and Challenges
  9. Phased Roadmap
  10. Conclusion and Call to Action

Companion Documents:


1. The Problem: Why Global Transport Data is Broken

1.1 Fragmentation Across Modes, Geographies, and Institutions

Global transport data is scattered across hundreds of disconnected systems:

  • By mode: Road safety data (WHO), aviation (ICAO), maritime (IMO), railways (UIC), urban transit (UITP) -- each with different standards, coverage, and access models
  • By geography: OECD countries have reasonable data through ITF; the rest of the world has patchy coverage at best
  • By institution: World Bank, regional development banks, national statistics offices, city transport authorities, and private operators each hold fragments
  • By format: Static PDFs, proprietary databases, GIS shapefiles, GTFS feeds, APIs, and paper records

There is no single place where a policymaker, researcher, or planner can get a comprehensive view of the global transport system.

1.2 The Developing Country Data Desert

The data gap is most severe where it matters most:

  • Sub-Saharan Africa: Only 17 of 54 countries have published GTFS data for any city. Road crash data is estimated rather than counted in most countries.
  • South Asia: Despite massive transport challenges, standardised data is limited to a few major cities.
  • Small Island Developing States: Transport data is almost entirely absent, despite extreme vulnerability to climate impacts on transport infrastructure.
  • Landlocked Developing Countries: Cross-border transport data is critical but virtually non-existent in standardised form.

This is not merely a technical gap -- it is an equity gap. Countries with the least data have the least capacity to make evidence-based transport decisions, leading to worse outcomes for their populations.

1.3 The Standardisation Challenge

Even where data exists, it does not interoperate:

DomainStandardsStatus
Public transitGTFS, NeTEx, SIRI, TransXChangeMultiple competing standards, partial adoption
Road trafficDATEX II, TMC, TPEGRegional, not global
Road safetyIRTAD, WHO methodologyDifferent classification systems
Freight/logisticsUN/EDIFACT, CEN standardsFragmented by mode and region
InfrastructureVarious national asset management systemsNo global standard
EmissionsIPCC guidelines, national inventory formatsTransport often poorly disaggregated
Urban mobilityMDS, GBFS, CDS-MEmerging, limited adoption

The result: even when two countries collect similar data, it often cannot be directly compared.

1.4 The Static Data Problem

Much of the available global transport data is:

  • Years out of date: Census-based transport surveys may be 5-10 years old
  • Snapshot, not time series: One-off studies rather than continuous monitoring
  • Aggregate, not granular: National-level averages masking subnational variation
  • Supply-side, not demand-side: Infrastructure inventories without usage/demand data

Meanwhile, the transport sector is transforming rapidly -- electrification, shared mobility, autonomous vehicles, post-pandemic travel pattern shifts -- making historical data less useful for current planning.

1.5 The Consequences

Without adequate data, the world cannot:

  • Monitor SDG progress: Transport-related SDG indicators (11.2, 3.6, 9.1) are among the most data-poor
  • Track decarbonisation: Transport NDC commitments cannot be verified without baseline and monitoring data
  • Allocate investment efficiently: Development finance for transport (~$300 billion/year) is guided by incomplete evidence
  • Save lives: Road safety interventions cannot be targeted without crash data
  • Plan equitably: The needs of women, disabled people, and rural populations are invisible without disaggregated data

2. The Opportunity: Why Now

2.1 The UN Decade Creates Institutional Demand

The proclamation of 2026-2035 as the UN Decade for Sustainable Transport (see deep dive) creates an unprecedented decade-long institutional demand signal for transport data and intelligence. Specifically:

  • General Assembly Resolution A/RES/79/225 calls for improved data collection and sharing
  • The Beijing Statement (2024) called for a Global Transport Atlas
  • UNDESA and UNECE are tasked with coordinating the Decade and need data infrastructure
  • The Decade will require monitoring frameworks that demand better data

A Global Intelligence System positioned as "the data backbone of the Decade" would have political legitimacy, institutional demand, and a clear mandate.

2.2 AI/LLM Technology Has Reached a Tipping Point

The technology required to build the Transport Angel was not feasible three years ago. Key breakthroughs (detailed in AI applications analysis):

  • Large Language Models (GPT-4, Claude, Gemini, open-source alternatives) can now reliably translate natural language questions into structured queries
  • Retrieval-Augmented Generation (RAG) enables accurate, cited answers from document corpora
  • Geospatial AI can derive transport data (road networks, traffic patterns) from satellite imagery, filling gaps where no ground-based data exists
  • Multilingual capabilities mean the system can serve users in multiple UN languages from day one
  • Costs are falling rapidly: What cost $100 per query in 2023 costs under $1 in 2026

The window is open: the technology is mature enough to be reliable but early enough that the transport sector can be a leader rather than a follower.

2.3 The Open Data Movement is Building Momentum

  • GTFS adoption has expanded from a handful of cities to thousands globally
  • OpenStreetMap transport data coverage continues to improve, especially through organised mapping efforts in developing countries
  • Open transit data mandates are spreading (EU, US, and increasingly in Latin America and Asia)
  • Development organisations (World Bank, GIZ, FCDO) are increasingly requiring open data in funded projects
  • SharedStreets, OpenTraffic, and similar initiatives have demonstrated that open transport data can achieve global scale

2.4 Climate Urgency Demands Better Transport Data

  • Transport is responsible for ~24% of energy-related CO2 emissions and rising
  • The gap between Paris Agreement goals and transport sector emissions is widening
  • Enhanced NDCs increasingly include transport targets, but monitoring systems are absent
  • Climate finance for transport requires rigorous baseline and MRV (Measurement, Reporting, Verification) data
  • Climate TRACE has shown that AI-derived emissions monitoring is possible for transport

2.5 The SDG Countdown Creates Political Pressure

With 2030 approaching:

  • SDG 11.2 (access to public transport) remains one of the least measured indicators
  • SDG 3.6 (road traffic deaths) has data but shows insufficient progress
  • SDG 9.1 (quality infrastructure) lacks transport-specific measurement frameworks
  • There will be intense political pressure to demonstrate progress or explain failure -- both require data

3. The Vision: What a Global Intelligence System Could Be

3.1 Core Concept

A federated data platform that:

  • Aggregates transport data from diverse global sources (national statistics, open data, satellite-derived, crowdsourced)
  • Harmonises data to common standards and taxonomies
  • Fills gaps using AI, satellite imagery, and statistical modelling
  • Provides access through multiple channels: API, web portal, and AI chatbot (Transport Angel)
  • Enables analysis for diverse users from UN analysts to city planners to researchers

3.2 What It Is NOT

  • Not a replacement for national transport data systems -- it is a layer above them
  • Not a real-time control system -- it is an intelligence and planning tool
  • Not exclusively a technology project -- it is equally about governance, standards, and capacity building
  • Not a one-off product -- it is sustained infrastructure for the Decade and beyond

3.3 Design Principles

Drawing on lessons from comparable systems (see precedent analysis):

PrincipleRationale
Federated, not centralisedData stays with owners; the platform provides discovery, access, and harmonisation
Open by defaultFree access maximises impact and adoption; closed data by exception only
Standards-basedBuild on existing transport data standards (GTFS, NeTEx, etc.) and extend where needed
AI-nativeAI/LLM capabilities designed in from the start, not bolted on later
Equity-firstPrioritise filling data gaps in underserved regions, not just aggregating existing rich data
MultilingualSupport UN working languages at minimum
TrustworthyEvery AI-generated insight traceable to source data with confidence levels
SustainableGovernance, funding, and technology choices that can endure beyond any single grant cycle

3.4 Illustrative User Journeys

Journey 1: The Transport Minister in an LDC

"How does our country's road safety performance compare to our regional peers, and what interventions have been most effective in similar contexts?"

The system retrieves comparative road safety data, adjusting for data quality differences, identifies peer countries, searches the evidence base for effective interventions in comparable contexts, and presents a synthesised answer with citations to source data and research.

Journey 2: The UNDESA Analyst

"Give me a dashboard of Decade progress indicators for all 193 member states, highlighting where data is missing."

The system queries across multiple datasets, compiles a multi-indicator dashboard, identifies data gaps, and flags where AI-derived estimates could fill gaps with appropriate confidence levels.

Journey 3: The Climate Negotiator

"Summarise transport decarbonisation commitments in the latest round of NDCs and compare to the trajectories needed for 1.5 degrees."

The system searches the NDC registry, extracts transport-specific commitments using NLP, compares them to science-based trajectories from IEA and IPCC, and identifies the gap.

Journey 4: The City Planner in Bogota

"Show me cities of similar size and income level that have implemented BRT systems. What ridership have they achieved and what were the key design features?"

The system searches project databases, matches cities by characteristics, retrieves BRT performance data, and synthesises lessons learned.


4. The Transport Angel: AI-Powered Access to Transport Knowledge

4.1 Concept

The "Transport Angel" is an LLM-powered chatbot that serves as the primary interface for the Global Intelligence System. It makes the world's transport data and knowledge accessible through natural language conversation.

4.2 Core Capabilities

  1. Natural language querying of structured data: Users ask questions; the system queries databases and returns answers with visualisations
  2. Document and knowledge synthesis: RAG-based querying across transport reports, policies, standards, and research
  3. Geospatial intelligence: "Show me..." queries that return maps and spatial analysis
  4. Comparative analysis: Cross-country, cross-city, cross-mode comparisons on demand
  5. Trend and gap identification: Proactive identification of data gaps, emerging trends, and anomalies
  6. Multilingual interaction: Engage in all UN working languages

4.3 Why "Angel" and Not Just a Dashboard?

  • Dashboards require data literacy: Users must know what to look for. A chatbot can guide exploration.
  • Dashboards are pre-configured: They show what the designer anticipated. A chatbot can answer unanticipated questions.
  • Dashboards do not synthesise: They display data. A chatbot can integrate data with contextual knowledge.
  • Dashboards do not democratise: They serve data specialists. A chatbot serves anyone who can ask a question.
  • Dashboards still matter: The Angel should be able to generate dashboard-style visualisations on demand.

4.4 Technical Feasibility

Based on the AI landscape analysis (see full analysis):

CapabilityFeasibility (2026)Key Dependency
Text-to-SQL queryingHighWell-curated, documented database schemas
RAG over transport documentsHighComprehensive document indexing pipeline
Multilingual conversationHighModern LLMs have strong multilingual capability
Map/spatial responsesMedium-HighIntegration with mapping libraries and geospatial data
Cross-dataset synthesisMediumData harmonisation and ontology work
Real-time data integrationMediumAvailability of real-time feeds
Accuracy guaranteesMediumHallucination mitigation, citation requirements

4.5 Build vs Buy vs Hybrid

ApproachProsCons
Build on commercial LLM APIsFastest to prototype, highest capabilityVendor dependency, ongoing cost, data privacy
Build on open-source LLMsFull control, no vendor lock-inHigher upfront effort, potentially lower capability
HybridBest of both -- open-source core with commercial for advanced featuresArchitectural complexity

Recommendation: Start with commercial APIs for rapid prototyping and user testing, with a roadmap to open-source alternatives for long-term sustainability. The hybrid approach provides the best balance of speed and independence.


5. The UN Decade as an Enabling Framework

(See full analysis)

5.1 Political Legitimacy

The Decade provides:

  • A General Assembly mandate that legitimises global transport data cooperation
  • Institutional homes (UNDESA for coordination, UNECE for standards) that can anchor the system
  • A ten-year timeframe that matches the long-term investment needed for data infrastructure
  • Political attention cycles (launch, mid-term, review) that create demand for data products

5.2 Alignment with Decade Pillars

The Global Intelligence System serves all six Decade pillars:

Decade PillarHow the System Contributes
Climate/DecarbonisationEmissions monitoring, NDC tracking, scenario modelling
Safety/HealthCrash data harmonisation, predictive safety analysis
Access/EquityTransit coverage mapping, accessibility analysis
Resilience/AdaptationInfrastructure vulnerability assessment, disruption monitoring
Innovation/DigitalisationThe system itself is the innovation pillar's flagship
Governance/FinanceInvestment tracking, policy monitoring, SDG indicator dashboard

5.3 The "Data Backbone" Positioning

The strongest strategic positioning for the system is as "the data backbone of the UN Decade for Sustainable Transport". This framing:

  • Makes it essential infrastructure, not a nice-to-have project
  • Aligns all partners around a shared, apolitical goal (better data)
  • Creates demand from the Decade's own monitoring and reporting needs
  • Justifies sustained funding over the full 2026-2035 period

6. Learning from Other Sectors

(See full analysis)

6.1 Key Precedents

SystemSectorKey Lesson for Transport
WHO Global Health ObservatoryHealthInstitutional mandate + standardised classifications = authoritative platform
Copernicus Climate Data StoreClimateBring compute to data; open access at massive scale
Climate TRACEClimateAI-native architecture can leapfrog traditional data collection
CGIAR GARDIANAgricultureFederated discovery layer over distributed data
OCHA HDXHumanitarianCommunity-driven, low-barrier data sharing achieves rapid adoption
GBIFBiodiversityData standards + incentives + national nodes = global scale
Global Fishing WatchOceansSatellite + AI + transparency transforms a sector

6.2 The Transport Leapfrog Opportunity

None of the existing global data platforms have deeply integrated AI/LLM capabilities. Most were built before the current generation of LLMs existed. The Global Intelligence System for Transport has the opportunity to be the first major global sectoral data platform designed from the ground up around AI capabilities.

This is a genuine first-mover advantage: building the Transport Angel now, when AI is mature enough to be useful but before other sectors have set the pattern, could make the transport system a model for others.

Drawing on precedent analysis:

Layer 1: STANDARDS (cf. GBIF's Darwin Core)
   Transport Data Standards Framework
   Common taxonomies, ontologies, classifications
   Building on GTFS, NeTEx, DATEX II, SIRI, etc.
        |
Layer 2: DATA SHARING (cf. OCHA HDX)
   Transport Data Exchange
   Open catalogue, contribution mechanisms
   Quality tiers, metadata standards
        |
Layer 3: FEDERATED ACCESS (cf. CGIAR GARDIAN)
   Unified query across distributed sources
   API gateway, authentication, rate limiting
        |
Layer 4: ANALYTICS (cf. Copernicus CDS Toolbox)
   Cloud-based analysis environment
   Pre-built indicators, custom analysis tools
        |
Layer 5: AI GAP-FILLING (cf. Climate TRACE)
   Satellite-derived transport data
   Statistical modelling for missing data
   Confidence-scored estimates
        |
Layer 6: INTELLIGENCE INTERFACE (NOVEL)
   Transport Angel chatbot
   Natural language access to all layers
   Multilingual, multi-modal (text, maps, charts)

7. The Coalition: Partners and Roles

7.1 Current Partners and Comparative Advantages

PartnerStrengthsPotential Role
FCDOBilateral funding, policy influence, LMIC focusFunder, political champion, demand articulation from developing countries
WRIUrban transport data (NUMO, TUMI), data platforms (Resource Watch), presence in Global SouthData contribution, platform design, city engagement
GIZTechnical cooperation, TUMI initiative, bilateral programmes in 60+ countriesCapacity building, country engagement, data collection support
TRLTransport research, safety data expertise, LMIC research programmesEvidence base, quality assurance, safety data standards
University of OxfordTransport Studies Unit, data science, academic credibilityResearch, methodology, independent evaluation
UNDESADecade coordination, convening power, SDG monitoringInstitutional home, political legitimacy, UN system integration
UNECETransport standards (WP.29, WP.1), statistics, ForFITS modelStandards leadership, regulatory framework, European node

7.2 Critical Gaps in the Coalition

The current coalition is strong but has gaps that should be addressed:

GapWhy It MattersPotential Partners
Technology/AI expertiseBuilding the Transport Angel requires deep AI engineeringGoogle/DeepMind, Microsoft AI, Anthropic, or specialised AI firms
Regional representationCurrent coalition is heavily Euro-AtlanticUNECA, UNESCAP, UNECLAC, AfDB, ADB, IDB
Private sector dataMajor transport data held by private operatorsUber, Google Maps, TomTom, Moovit, transit agencies
National statistics officesOfficial statistics credibilityUNSD, national NSOs, Paris21
Geospatial/satelliteAI-derived gap-filling capabilityESA, NASA, Planet Labs, Development Seed
Open data communityStandards and crowdsourced dataOpenStreetMap, MobilityData, Open Transport Partnership

7.3 Governance Considerations

Based on precedent analysis, the governance model should include:

  • Steering Committee: Senior representatives from partner organisations, meeting quarterly
  • Technical Advisory Group: Data scientists, transport experts, AI specialists
  • User Advisory Group: Representatives from target user communities (LDC policymakers, UN analysts, researchers, city planners)
  • Secretariat: Small dedicated team housed within an appropriate institution (UNDESA, UNECE, or WRI)
  • Open governance: Transparent decision-making, public roadmap, community input mechanisms

8. Risks and Challenges

8.1 Data Risks

RiskLikelihoodImpactMitigation
Data sovereignty concernsHighHighFederated architecture (data stays with owners); clear data governance agreements; respect national ownership
Data quality issuesHighHighQuality tiers and transparency; confidence scoring; validation mechanisms
Insufficient data in LMICsHighHighAI/satellite gap-filling; capacity building investment; prioritise LMICs in early phase
Proprietary data barriersMediumHighAdvocacy for open data; partnership models; work with what is available
Data privacy violationsMediumHighPrivacy-by-design; aggregation thresholds; compliance with GDPR and national laws

8.2 Technical Risks

RiskLikelihoodImpactMitigation
AI hallucinationMediumHighStrict RAG architecture; citation requirements; human validation; confidence scoring
Vendor lock-inMediumMediumOpen-source core; multi-provider strategy; abstraction layers
Scalability challengesMediumMediumCloud-native architecture; phased rollout; performance testing
Integration complexityHighMediumStart simple (few sources), expand incrementally; well-defined APIs
Rapid technology changeHighLow-MediumModular architecture; regular technology reviews; avoid over-commitment to current tools

8.3 Institutional and Political Risks

RiskLikelihoodImpactMitigation
Partner misalignmentMediumHighClear MoU; defined roles; shared governance; regular alignment
Funding discontinuityMediumHighDiversified funding; demonstrate value early; embed in institutional budgets
Political resistanceMediumMediumTransparency about data provenance; national ownership emphasis; build trust through consultation
Scope creepHighMediumPhased roadmap with clear scope boundaries; strong product management
UN bureaucratic inertiaMediumMediumAgile delivery alongside formal UN processes; demonstrate quick wins
Competition from commercial platformsMediumLow-MediumFocus on public good mission; open access; areas where commercial sector will not serve

8.4 Sustainability Risks

RiskLikelihoodImpactMitigation
Grant dependencyHighHighMove toward institutional funding (UN regular budget, member state contributions)
Staff turnoverMediumMediumDocumentation; open-source contributions; institutional knowledge management
Technology obsolescenceMediumMediumModular, standards-based architecture; regular updates
User adoption failureMediumHighUser-centred design; early engagement; demonstrate immediate value

8.5 The Biggest Risk: Doing Nothing

The greatest risk is inaction. Without a coordinated global transport intelligence system:

  • The UN Decade will lack the data infrastructure to monitor its own progress
  • Developing countries will continue to plan transport systems without evidence
  • Climate targets for transport will remain unmonitored and unmet
  • Billions in transport investment will be guided by incomplete information
  • The opportunity to leverage AI for transport data will be captured by commercial platforms serving commercial interests, not the public good

9. Phased Roadmap

Phase 0: Foundation (2026, Months 1-6)

Goal: Establish the initiative, secure initial funding, define scope

  • Formalise the partnership through an MoU or equivalent
  • Secure seed funding (target: $2-5M for Phase 1)
  • Define the Transport Data Standards Framework (building on existing standards)
  • Conduct a comprehensive inventory of existing global transport data sources
  • Design the governance model
  • Recruit core technical team (5-10 people)
  • Deliverable: Partnership agreement, data landscape assessment, system design document

Phase 1: Proof of Concept (2026-2027, Months 6-18)

Goal: Demonstrate the core concept works

  • Build the first version of the Transport Angel chatbot
  • Start with 5-10 curated datasets (e.g., WHO road safety, ITF statistics, GTFS aggregation, Climate TRACE transport emissions)
  • Index a corpus of 500+ transport policy documents for RAG
  • Deploy for a limited user group (50-100 UN analysts, researchers, policymakers)
  • Gather feedback and iterate
  • Deliverable: Working prototype with limited but real data, user feedback report

Phase 2: Expansion (2027-2028, Months 18-36)

Goal: Scale data coverage and user base

  • Expand to 50+ data sources
  • Add AI-derived data (satellite-based road network extraction, traffic estimation)
  • Launch the federated data exchange (HDX-style contribution mechanism)
  • Open to 1,000+ users across all partner organisations and developing country governments
  • Add multilingual support (all UN working languages)
  • Develop the Transport Analysis Workbench (cloud-based analysis tools)
  • Deliverable: Production platform with broad data coverage, active user community

Phase 3: Maturation (2028-2030, Months 36-60)

Goal: Achieve authoritative status for SDG 2030 review

  • Comprehensive global coverage across all transport modes
  • Become the reference platform for SDG transport indicator monitoring
  • Deep integration with national transport data systems
  • Advanced AI capabilities (predictive analysis, scenario modelling, automated reporting)
  • Self-sustaining governance and funding model
  • Deliverable: Authoritative global transport intelligence platform, SDG 2030 assessment contribution

Phase 4: Consolidation (2030-2035)

Goal: Sustain and evolve through the second half of the Decade

  • Continuous improvement based on technology evolution and user needs
  • Comprehensive impact assessment for the Decade's final review
  • Transition planning for post-Decade sustainability
  • Knowledge transfer and capacity building for national systems
  • Deliverable: Decade impact assessment, sustainable institutional framework

10. Conclusion and Call to Action

The Case in Brief

The world needs better transport data. The UN Decade demands it. AI now makes it possible to deliver it in ways that were unimaginable five years ago. A coalition of leading organisations is ready to build it.

What Makes This Different from Past Attempts

Previous efforts at global transport data harmonisation have been:

  • Too narrow (one mode, one issue) -- this is cross-cutting
  • Too static (databases, not intelligence) -- this includes AI-powered querying
  • Too supply-driven (build it and they will come) -- this starts with user needs
  • Too centralised (one organisation controlling all data) -- this is federated
  • Too disconnected from political frameworks -- this is embedded in the UN Decade

The Ask

To move from opportunity to reality, the coalition needs:

  1. Political endorsement: Position the system within the Decade's Action Plan through UNDESA/UNECE advocacy
  2. Seed funding: $2-5M for Phase 0 and Phase 1 (2026-2027) -- this could come from FCDO, GIZ, or a combination of bilateral donors
  3. Technical commitment: Partners to allocate staff time and data access
  4. AI partnership: Engagement with one or more AI organisations for the Transport Angel
  5. User engagement: Early identification of 50-100 pilot users from diverse contexts

The Prize

If successful, the Global Intelligence System for Transport would:

  • Give every country in the world access to transport evidence, regardless of their current data capacity
  • Enable the UN Decade to actually monitor and demonstrate its impact
  • Accelerate transport decarbonisation through better data and analysis
  • Save lives through improved safety data and targeting of interventions
  • Pioneer a new model of AI-powered global public good data infrastructure
  • Demonstrate that international cooperation can harness technology for sustainable development

The window is open. The question is whether we walk through it.


Document prepared: February 2026 Research and analysis by the Transport Intelligence Initiative For questions, contact the project coordination team via the partner organisations


DocumentDescription
UN Decade Deep DiveDetailed analysis of the UN Decade for Sustainable Transport, its resolutions, pillars, and data implications
AI/LLM ApplicationsSurvey of AI and LLM use cases in transport, technical feasibility assessment, Transport Angel design considerations
Comparable SystemsAnalysis of precedent global data platforms in health, climate, agriculture, humanitarian, and other sectors

The UN Decade for Sustainable Transport (2026-2035): Deep Dive

1. Overview and Origins

The United Nations General Assembly proclaimed 2026-2035 as the United Nations Decade of Sustainable Transport through Resolution A/RES/79/225, adopted in late 2024. This followed years of advocacy, building on the outcomes of the Second UN Global Sustainable Transport Conference (Beijing, October 2024) and earlier milestones including the First Conference (Ashgabat, 2016) and decades of work by UNECE, UNCTAD, and other UN agencies.

The Decade represents the first time the international community has designated a full decade-long focus on transport, reflecting the sector's cross-cutting importance to virtually all 17 Sustainable Development Goals.

2. Key Resolutions and Mandates

General Assembly Resolution A/RES/79/225

The resolution establishing the Decade calls upon member states, international organisations, and stakeholders to:

  • Accelerate the transition to sustainable, low-carbon, resilient transport systems
  • Strengthen data collection, sharing, and analysis on transport at national and global levels
  • Promote interoperability and standardisation of transport systems and data
  • Enhance capacity building in developing countries, particularly Least Developed Countries (LDCs), Landlocked Developing Countries (LLDCs), and Small Island Developing States (SIDS)
  • Leverage digital technologies and innovation, including AI and data analytics, to improve transport planning and operations

The Beijing Statement (2024)

The outcome document from the Second Global Sustainable Transport Conference specifically highlighted:

  • The need for a global transport data framework to monitor progress
  • Calls for a Global Transport Atlas or equivalent knowledge platform
  • The importance of intermodal and cross-border data sharing
  • Recognition that data gaps in developing countries are a critical barrier to evidence-based transport policy

Earlier Foundational Resolutions

  • A/RES/72/212 (2017): Strengthening links between transport modes for SDG achievement
  • A/RES/76/294 (2022): Calling for sustainable transport integration into national development strategies
  • The New Urban Agenda (2016): Emphasising data-driven urban transport planning

3. Goals and Thematic Priorities of the Decade

The Decade is structured around several interconnected thematic pillars:

Pillar 1: Climate Action and Decarbonisation

  • Transport accounts for approximately 24% of direct CO2 emissions from fuel combustion globally
  • The sector is the fastest-growing source of emissions in many developing countries
  • Goal: Align transport systems with the Paris Agreement's 1.5 degrees C pathway

Pillar 2: Safety and Health

  • 1.35 million road traffic deaths annually (WHO Global Status Report)
  • Target aligned with the UN Decade of Action for Road Safety and the Stockholm Declaration
  • Reducing air pollution from transport, which causes an estimated 385,000 premature deaths annually

Pillar 3: Access and Equity

  • Over 1 billion people lack adequate access to transport
  • Rural accessibility, gender-responsive transport, and disability inclusion
  • Freight connectivity for landlocked developing countries

Pillar 4: Resilience and Adaptation

  • Transport infrastructure vulnerable to climate change impacts
  • Supply chain resilience demonstrated as critical during COVID-19
  • Disaster preparedness and recovery

Pillar 5: Innovation and Digitalisation

  • This is where a Global Intelligence System fits most directly
  • Harnessing AI, big data, IoT, and digital twins for transport
  • Open data standards and interoperability frameworks
  • Intelligent Transport Systems (ITS) deployment

Pillar 6: Governance and Finance

  • Mobilising investment for sustainable transport infrastructure
  • Policy coherence across transport modes and government levels
  • Monitoring frameworks and indicators

4. Institutional Architecture

Lead Agencies

  • UNDESA (UN Department of Economic and Social Affairs): Overall coordination of the Decade
  • UNECE (UN Economic Commission for Europe): Technical standards, inland transport regulations, and the WP.29/WP.1 framework
  • Regional Commissions: UNECA, UNESCAP, UNECLAC, UNESCWA each have regional transport mandates

Key Partners

  • World Bank: Transport Global Practice, major funder of transport infrastructure
  • World Resources Institute (WRI): Urban transport data, NUMO partnership
  • GIZ (Deutsche Gesellschaft fuer Internationale Zusammenarbeit): Bilateral transport programmes, TUMI initiative
  • TRL (Transport Research Laboratory): Research and evidence base, particularly for LMICs
  • ITF (International Transport Forum): Transport statistics and policy analysis at OECD
  • FCDO (UK Foreign, Commonwealth & Development Office): Bilateral funding for transport in developing countries

The Role of Academia

  • University of Oxford: Transport Studies Unit, contributions to evidence synthesis
  • Imperial College London, MIT, Tsinghua: Key research nodes
  • Demand for better data to support research and evidence-based policy

5. The Data and Evidence Dimension

What the Decade Needs from Data

The Decade's monitoring framework requires:

  1. Baseline data: Current state of transport systems in all countries
  2. Progress indicators: Tracking decarbonisation, safety improvements, access expansion
  3. Comparable metrics: Standardised definitions across countries and modes
  4. Timeliness: Annual or more frequent updates, not decade-lag census data
  5. Disaggregation: By mode, geography, gender, income group

Current Data Architecture (Gaps)

The existing global transport data landscape is fragmented:

SourceCoverageLimitations
ITF Transport StatisticsOECD + select partnersLimited developing country coverage
WHO Road Safety DataGlobal (194 countries)Safety only, biennial updates
UITP Urban Mobility DataCities, members onlyProprietary, expensive
OpenStreetMapGlobal, crowdsourcedVariable quality, no operational data
National statisticsCountry-levelNon-standardised, infrequent
IEA Transport EnergyGlobalEnergy/emissions only
UIC Railway StatisticsGlobal railRail mode only

The Case for a Global Intelligence System

The Decade cannot achieve its goals without a step change in transport data. A Global Intelligence System would:

  • Serve as the data backbone for monitoring Decade progress
  • Harmonise disparate data sources into a coherent, queryable platform
  • Democratise access so that LDC policymakers have the same evidence base as OECD countries
  • Enable AI-powered analysis to surface insights from complex, multimodal datasets
  • Provide the "Transport Atlas" called for in UN resolutions and conference outcomes

6. Political Momentum and Windows of Opportunity

2026: The Launch Year

  • The Decade formally begins, creating maximum political attention
  • Opportunity to position the Global Intelligence System as a founding initiative
  • UN Secretary-General's report on the Decade will likely highlight data gaps

Key Milestones Ahead

  • 2026: Decade launch events, first Action Plan expected
  • 2027: Expected mid-term review of SDG transport indicators
  • 2028: COP33, continued pressure on transport decarbonisation data
  • 2030: SDG target year -- demand for evidence on transport-related goals
  • 2035: Decade conclusion -- need for comprehensive impact assessment

Alignment with Other Frameworks

  • Paris Agreement: Enhanced NDCs increasingly include transport targets needing monitoring
  • Sendai Framework: Disaster-resilient transport data
  • Addis Ababa Action Agenda: Financing sustainable transport infrastructure
  • Global Biodiversity Framework: Transport infrastructure impact on ecosystems

7. Implications for the Global Intelligence System

The UN Decade creates a unique, time-bound opportunity:

  1. Institutional mandate: UN resolutions specifically call for better transport data systems
  2. Political cover: A decade-long framework provides sustained attention, not a one-off event
  3. Coalition of the willing: Key organisations (WRI, GIZ, TRL, UNECE, UNDESA, Oxford) are already engaged
  4. Funding catalysis: Decade status unlocks funding from multilateral, bilateral, and philanthropic sources
  5. Urgency: The 2030 SDG deadline is approaching with major data gaps still unfilled

The question is not whether a global transport intelligence system is needed -- the Decade's existence confirms that it is. The question is whether the proposed coalition can build it effectively, sustainably, and inclusively.


Document prepared: February 2026 Sources: UN General Assembly resolutions, Beijing Conference outcomes, UNDESA transport programme documentation, UNECE transport statistics framework

AI and LLM Applications in Transport: Use Cases and Opportunities

1. The Emerging Landscape

The convergence of large language models (LLMs), geospatial AI, and open transport data is creating new possibilities for how transport data is accessed, analysed, and acted upon. This document surveys current and emerging applications, with a focus on how they inform the design of a "Transport Angel" chatbot and broader Global Intelligence System.

2. Natural Language Querying of Transport Data

The Core Opportunity

Traditional transport data analysis requires GIS expertise, SQL knowledge, or proprietary software skills. LLMs enable natural language interfaces to complex datasets, dramatically lowering the barrier to evidence-based decision-making.

Current Examples and Approaches

Text-to-SQL and Text-to-API

  • How it works: A user asks a question in plain English (e.g., "What is the average commute time in Nairobi?"). An LLM translates this into a structured query against a database or API, executes it, and returns results in natural language.
  • Precedents:
    • Google BigQuery now supports natural language querying via Gemini integration
    • Microsoft Fabric's Copilot enables conversational queries of analytics datasets
    • Numerous open-source projects (e.g., LangChain SQL agents, LlamaIndex) provide frameworks for building text-to-SQL pipelines
    • Databricks has integrated LLM-based querying into their lakehouse platform

Retrieval-Augmented Generation (RAG) for Transport Documents

  • How it works: Thousands of transport policy documents, reports, and standards are indexed. When a user asks a question, the system retrieves relevant passages and uses an LLM to synthesise an answer with citations.
  • Transport applications:
    • Querying across the corpus of National Urban Mobility Policies
    • Searching UNECE transport regulations and conventions
    • Synthesising findings from transport safety research databases
    • Cross-referencing Nationally Determined Contributions (NDCs) for transport commitments

Conversational Data Exploration

  • Moving beyond single queries to multi-turn conversations where users can refine, drill down, compare, and visualise data iteratively
  • Example flow: "Show me road fatality rates in East Africa" -> "Compare that to Southeast Asia" -> "What are the main contributing factors?" -> "Which countries have seen the biggest improvements?"

Technical Architecture Patterns

User Query (Natural Language)
    |
    v
Intent Classification + Entity Extraction
    |
    v
[Route to appropriate backend]
    |---> Structured Data (SQL/API query)
    |---> Document Corpus (RAG retrieval)
    |---> Geospatial Data (Map query)
    |---> Real-time Feeds (Streaming data)
    |
    v
LLM Synthesis + Citation
    |
    v
Response (Text + Visualisation + Sources)

3. Geospatial AI for Transport

Satellite and Remote Sensing Analysis

  • Road network extraction: AI models (e.g., from Meta/Facebook AI, Microsoft) can now extract road networks from satellite imagery with high accuracy
  • Traffic estimation from space: Emerging techniques use satellite imagery to estimate traffic volumes, particularly valuable where no sensor infrastructure exists
  • Infrastructure condition assessment: ML models detect road surface quality, bridge conditions, and other infrastructure states from imagery
  • Relevance: These techniques could fill massive data gaps in countries where ground-based monitoring is absent

Key Projects and Platforms

  • Google Open Buildings / Roads: AI-derived building footprints and road networks across Africa, South Asia, and Southeast Asia
  • Meta/Facebook Map With AI (RapiD editor): AI-assisted road mapping contributed to OpenStreetMap
  • Microsoft Road Detections: ML-derived road data for underserved regions
  • Mapillary: Street-level imagery platform enabling AI analysis of road infrastructure
  • Development Seed: AI-powered geospatial tools used by development organisations
  • Allen AI / Satlas: Foundation models for satellite imagery analysis

Urban Mobility Pattern Analysis

  • Mobile phone data analytics: Companies like Carto, Unacast, and Veraset analyse anonymised mobile location data to understand travel patterns
  • Transit data platforms: Platforms like Remix (by Via), Conveyal, and TransitMatters use data analytics for transit planning
  • Shared mobility data: MDS (Mobility Data Specification) and GBFS (General Bikeshare Feed Specification) standards enable aggregation of micromobility data

4. AI for Transport Safety

Predictive Safety Analysis

  • Crash prediction models: ML models identifying high-risk road segments based on geometric, traffic, and environmental features
  • iRAP (International Road Assessment Programme): Uses AI and street-level imagery to rate road safety and estimate crash risk globally
  • Video-based conflict analysis: AI systems analysing traffic camera footage to detect near-misses and predict crash hotspots before incidents occur

Natural Language Processing for Safety

  • Crash report analysis: NLP models extracting structured data from unstructured police crash reports
  • News monitoring: AI systems scanning local news for transport safety incidents to supplement official statistics
  • Social media analysis: Detecting transport disruptions and safety concerns from social media posts

5. AI for Transport Planning and Operations

Demand Forecasting and Modelling

  • Traditional transport models (four-step, activity-based) are being augmented with ML techniques
  • Deep learning for travel demand: Neural networks predicting origin-destination flows, mode choice, and route selection
  • Generative AI for scenario planning: LLMs helping planners explore "what if" scenarios in natural language

Optimisation

  • Network design: AI-optimised public transit routes and schedules (e.g., Optibus, Spare Labs)
  • Traffic signal control: Reinforcement learning for adaptive signal timing (e.g., Google's Project Green Light)
  • Freight logistics: Route optimisation, load planning, and supply chain resilience
  • EV charging infrastructure: AI-optimised placement of charging stations based on demand patterns

Real-time Operations

  • Incident detection and response: AI systems detecting incidents from sensor and camera data
  • Predictive maintenance: ML models for transport infrastructure and fleet maintenance scheduling
  • Dynamic pricing: AI-driven congestion pricing and demand management

6. The "Transport Angel" Concept

What It Could Be

A "Transport Angel" chatbot for the Global Intelligence System would serve as the primary interface for diverse users to interact with the world's transport data. The concept draws on several emerging patterns:

User Personas and Use Cases

PersonaExample QueryData Required
LMIC Policymaker"How does our road safety spending compare to peer countries?"Cross-country expenditure data, safety statistics
UN Analyst"Which countries are off-track on SDG 11.2 (public transport access)?"SDG indicator data, trend analysis
City Planner"Show me successful BRT implementations in cities of similar size to ours"Project databases, city characteristics
Climate Negotiator"What transport decarbonisation commitments have been made in NDCs?"NDC database, transport sector analysis
Researcher"What is the evidence base for congestion pricing effectiveness?"Research literature, case study data
Journalist"Which countries have the most dangerous roads?"Safety statistics, comparative indices
Donor/Investor"Where are the biggest gaps in transport infrastructure investment?"Investment data, needs assessments

Technical Design Considerations

Multi-modal Reasoning

The chatbot must integrate across:

  • Structured data: Statistics, indicators, time series
  • Unstructured data: Reports, policies, regulations
  • Geospatial data: Maps, routes, spatial analysis
  • Real-time data: Where available, live feeds and alerts

Multilingual Capability

  • UN working languages minimum (English, French, Spanish, Arabic, Chinese, Russian)
  • Ideally extending to languages of key user communities
  • Current LLMs have strong multilingual capabilities but variable quality

Citation and Transparency

  • Every response must be traceable to source data
  • Confidence levels should be communicated
  • Users must be able to drill down from summary to underlying data
  • Hallucination mitigation is critical for a system used in policy decisions

Guardrails and Accuracy

  • Transport data has real-world consequences (safety, investment, policy)
  • The system must clearly distinguish between data-backed answers and analytical inferences
  • Human-in-the-loop validation for high-stakes queries
  • Regular benchmarking against expert-validated answers

7. Precedents for AI-Powered Data Chatbots

In the Development Sector

  • World Bank DataBot: Experimental chatbot for querying World Bank Open Data
  • UNICEF Magicbox: Data science platform with conversational elements for child welfare data
  • UN Global Pulse: Using AI and big data for development insights
  • HDX (Humanitarian Data Exchange): OCHA's platform, increasingly adding AI-assisted search

In the Commercial Transport Sector

  • Google Maps Platform: Natural language search for transport options, increasingly AI-powered
  • Moovit (Intel): AI-driven transit information across 3,400+ cities
  • Citymapper: AI-powered journey planning with natural language interaction
  • Waze: Crowdsourced traffic intelligence with increasingly sophisticated AI

In Adjacent Sectors

  • Climate TRACE: AI-powered emissions tracking, including transport (see comparable-systems.md)
  • Resource Watch (WRI): Data platform with increasingly AI-assisted exploration
  • Google Earth Engine: Geospatial analysis platform adding AI capabilities

8. Technical Maturity Assessment

CapabilityMaturityReadiness for Transport Angel
Text-to-SQL queryingHighReady for deployment with curated schemas
RAG over document corpusHighReady with proper indexing pipeline
Multilingual LLM interactionMedium-HighGood for major languages, gaps in some
Geospatial query understandingMediumRequires specialised tooling
Real-time data integrationMediumDepends on data availability
Multi-modal reasoning (text+map+chart)MediumImproving rapidly, feasible in 2026
Hallucination preventionMediumRequires careful engineering
Cross-dataset joins/synthesisLow-MediumNovel challenge, research frontier

9. Risks Specific to AI in Transport

Hallucination and Misinformation

  • An AI system that gives incorrect transport safety statistics could lead to misallocated resources
  • Fabricated citations to non-existent reports could undermine trust
  • Mitigation: Strict RAG architecture, source citation requirements, confidence scoring

Bias and Equity

  • LLMs may perform better for data-rich countries than data-poor ones
  • English-language bias in training data could disadvantage non-English speaking users
  • Urban bias in available data could marginalise rural transport needs
  • Mitigation: Deliberate inclusion of LMIC data, multilingual testing, rural data partnerships

Data Privacy

  • Some transport data (e.g., mobile phone-derived mobility data) raises privacy concerns
  • Different jurisdictions have different data protection requirements
  • Mitigation: Privacy-by-design, aggregation thresholds, compliance with GDPR and local equivalents

Dependency and Sustainability

  • Reliance on commercial LLM APIs (OpenAI, Anthropic, Google) creates vendor dependency
  • API costs could become unsustainable at scale
  • Mitigation: Open-source model options, hybrid architecture, cost-sharing model

10. Recommendations

  1. Start with RAG over documents: The most mature and lowest-risk application; index the existing corpus of transport reports and enable natural language querying
  2. Add structured data querying: Build text-to-SQL capabilities against well-curated, standardised datasets
  3. Layer in geospatial intelligence: Partner with satellite/AI organisations for gap-filling in data-poor regions
  4. Design for trust: Every response cited, confidence levels shown, expert validation loops built in
  5. Open architecture: Use open standards, open-source components where possible, avoid lock-in
  6. Iterative deployment: Start with a limited user group (e.g., UN analysts), gather feedback, expand

Document prepared: February 2026 Sources: Industry reports, academic literature on AI in transport, platform documentation, development sector AI initiatives

Comparable Global Intelligence Systems: Precedents from Other Sectors

1. Introduction

The proposed Global Intelligence System for Transport is not without precedent. Several other sectors have built -- or are building -- global data platforms that aggregate disparate sources, create interoperability, and provide analytical tools for policymakers. This document examines the most relevant precedents, extracting lessons for the transport initiative.

2. Health: WHO Global Health Observatory (GHO)

Overview

The WHO Global Health Observatory is the most established example of a UN-led global data platform serving as the definitive source for a sector's statistics.

Architecture

  • Data sources: 194 member states submit data through standardised reporting mechanisms
  • Indicators: 1,000+ health indicators covering mortality, morbidity, health systems, risk factors
  • Data pipeline: National reporting -> WHO verification -> Harmonisation -> Publication
  • User interface: Web portal with visualisations, data downloads, API access
  • Update cycle: Annual for most indicators, with some near-real-time (e.g., disease surveillance)

What Works

  • Institutional mandate: WHO's constitution gives it authority to request health data from member states
  • Standardised classifications: ICD (International Classification of Diseases) provides a universal taxonomy
  • Capacity building: WHO invests in national statistical capacity alongside the platform
  • Political accountability: Health ministers are answerable for their data submissions
  • Long-term funding: Core WHO budget, supplemented by donor contributions

What Struggles

  • Data quality: Many LMICs lack vital registration systems; data is modelled/estimated
  • Timeliness: Typical 2-3 year lag between data collection and publication
  • Interoperability: Despite standards, significant harmonisation effort required
  • User experience: The portal is functional but not intuitive for non-specialists
  • AI integration: Limited; the platform remains largely a traditional data warehouse

Lessons for Transport

  • A mandated reporting mechanism is critical but takes decades to build
  • Taxonomies and classifications (equivalent of ICD for transport) are foundational
  • Estimation and modelling will be necessary where data is absent
  • Consider building AI/chatbot capabilities from the start, not as an afterthought
  • The WHO model shows that a global data platform can achieve authoritative status, but requires sustained institutional commitment

3. Climate: Copernicus Climate Data Store (CDS)

Overview

The Copernicus Climate Data Store, operated by ECMWF (European Centre for Medium-Range Weather Forecasts) for the EU, is perhaps the most technically sophisticated global environmental data platform.

Architecture

  • Data sources: Satellite observations, in-situ measurements, reanalysis datasets, climate model outputs
  • Volume: Petabytes of climate and environmental data
  • Toolbox: Users can process data using a cloud-based Python toolbox without downloading
  • API: Comprehensive programmatic access
  • Update cycle: Near-real-time for some datasets, monthly/annual for others

What Works

  • Compute brought to the data: Users process data in the cloud rather than downloading massive files
  • Consistent reanalysis: ERA5 reanalysis provides gridded global data back to 1940
  • Free and open access: All data freely available under open licences
  • High technical quality: ECMWF's scientific credibility underpins the platform
  • Strong funding: Copernicus programme has multi-billion euro EU commitment

What Struggles

  • Complexity: Steep learning curve for non-climate scientists
  • European governance: Global data, but EU-governed -- tensions around global ownership
  • Developing country access: Bandwidth and technical capacity barriers
  • Limited AI integration: The Toolbox is powerful but requires programming skills

Lessons for Transport

  • The "bring compute to data" model is relevant for large transport datasets
  • Reanalysis-type products could be valuable (e.g., reconstructing historical transport patterns from partial data)
  • The platform shows that open access + high quality can coexist
  • Governance must be genuinely global, not dominated by one region
  • A "Transport Data Store" could follow the CDS architectural model but with a more accessible interface (the Transport Angel chatbot)

4. Climate: Climate TRACE

Overview

Climate TRACE (Tracking Real-time Atmospheric Carbon Emissions) is a coalition using AI and satellite data to independently monitor global greenhouse gas emissions at the facility level.

Architecture

  • Data sources: Satellite imagery, remote sensing, public records, AI inference
  • Coverage: 352 million+ assets tracked globally across all sectors, including transport
  • AI models: Machine learning models estimate emissions from observable indicators (e.g., inferring factory output from heat signatures)
  • Update cycle: Near-real-time, independent of government reporting

What Works

  • Independence from government reporting: Can verify or fill gaps in national inventories
  • AI-native design: Built from the ground up around ML/AI capabilities
  • Granular spatial resolution: Facility-level and road-segment-level data
  • Coalition model: Diverse partners including Al Gore's organisation, WattTime, Carbon Mapper, academic institutions

What Struggles

  • Accuracy debates: Model-derived estimates are contested by some national governments
  • Sustainability: Heavy reliance on philanthropic funding
  • Ground truth validation: Limited ability to verify AI estimates in data-poor regions
  • Political sensitivity: Countries may resist independent monitoring

Lessons for Transport

  • AI-native architecture is viable and can leapfrog traditional data collection
  • Satellite-derived transport data (traffic volumes, fleet composition, infrastructure) is a real possibility
  • The coalition model (multiple organisations contributing AI models) could work for transport
  • Must navigate political sensitivities around independent vs. government-reported data
  • Transport Angel could integrate Climate TRACE transport data as a source

5. Agriculture: CGIAR Platform for Big Data in Agriculture

Overview

CGIAR (formerly the Consultative Group on International Agricultural Research) has built a platform for agricultural data sharing across its 15 research centres and beyond.

Architecture

  • GARDIAN (Global Agricultural Research Data Innovation and Acceleration Network): Metadata search engine across 800,000+ agricultural datasets
  • AgriLAC Data Space: Regional data space for Latin America and the Caribbean
  • Crop modelling platforms: Shared computational tools for agricultural research
  • Ontologies: AGROVOC and other standardised vocabularies

What Works

  • Federated model: Data stays with the source; the platform provides discovery and access
  • Ontologies and metadata standards: AGROVOC provides a multilingual agricultural thesaurus
  • Open access policy: CGIAR's Open Access/Open Data policy mandates sharing
  • Research orientation: Strong link between data platform and research questions

What Struggles

  • Fragmentation: Despite the platform, many datasets remain siloed
  • Data quality variability: Wide variation in quality across sources
  • User adoption: Researchers often prefer familiar tools over the platform
  • Sustainability: Dependent on CGIAR reform and funding cycles

Lessons for Transport

  • A federated model (metadata and access layer over distributed data) may be more realistic than centralisation
  • Transport ontologies and vocabularies are an essential prerequisite
  • Open data policies among partners must be agreed upfront
  • The platform must be useful enough that users prefer it to alternatives
  • GARDIAN's approach of being a discovery layer could inform the Transport Intelligence System architecture

6. Humanitarian: OCHA Humanitarian Data Exchange (HDX)

Overview

The Humanitarian Data Exchange (HDX), run by OCHA's Centre for Humanitarian Data, is a platform for sharing humanitarian data across organisations.

Architecture

  • Open platform: CKAN-based data catalogue
  • Coverage: 20,000+ datasets from 250+ organisations across 250+ locations
  • Data quality framework: Tiered quality assurance process
  • HDX HAPI (Humanitarian API): Standardised API for key humanitarian indicators
  • Data stories: Analytical products built on platform data

What Works

  • Community-driven: Organisations voluntarily contribute data
  • Quality tiers: Clear framework for data quality expectations
  • API-first design: HDX HAPI provides standardised programmatic access
  • Rapid adoption: Has become the go-to platform for humanitarian data
  • Low barrier: Easy to contribute and discover data

What Struggles

  • Voluntary contributions: Completeness depends on willingness of organisations
  • Data sensitivity: Humanitarian data can put vulnerable populations at risk
  • Sustainability: OCHA budget constraints affect the platform
  • Limited analytics: More of a data catalogue than an analytical platform

Lessons for Transport

  • A community-driven, low-barrier approach can achieve rapid adoption
  • Data quality frameworks are essential and should be built in from the start
  • An API-first design enables others to build on the platform
  • The HDX model could work for transport as an initial data-sharing layer
  • The Transport Angel could be the analytical layer on top of an HDX-like data sharing platform

7. Statistics: UN Global Platform for Official Statistics

Overview

The UN Global Platform, hosted by UNSD, provides a cloud-based environment where National Statistical Offices (NSOs) can collaborate on big data for official statistics.

Architecture

  • Cloud infrastructure: Shared computational environment
  • Collaborative workspace: NSOs can share code, methods, and experimental data
  • Pilot projects: Testing use of big data sources (satellite, mobile phone, scanner data) for official statistics
  • Capacity building: Training for NSOs in data science methods

What Works

  • NSO engagement: Builds trust with official statistical community
  • Shared infrastructure: Addresses capacity constraints in poorer NSOs
  • Experimentation: Safe space to test new methods without affecting official statistics
  • UN branding: Gives legitimacy and neutrality

What Struggles

  • Adoption: Uneven uptake across countries
  • Official statistics conservatism: NSOs are risk-averse, slow to adopt new methods
  • Governance: Complex multi-stakeholder governance
  • Limited public-facing products: More of a collaboration tool than a public data platform

Lessons for Transport

  • NSO engagement is important if transport data is to gain official statistics status
  • Shared cloud infrastructure can level the playing field for developing country partners
  • The platform shows the value of collaborative method development
  • Transport data could be a pilot theme within the UN Global Platform

8. Oceans: Global Fishing Watch

Overview

Global Fishing Watch uses satellite data and AI to track fishing activity worldwide, providing transparency on ocean resource use.

Architecture

  • AIS data: Automatic Identification System signals from vessels
  • Satellite imagery: Synthetic aperture radar and optical imagery for detecting vessels
  • AI models: Classification of vessel behaviour (fishing, transiting, anchoring)
  • Open platform: Free access to global maps and data downloads

What Works

  • Radical transparency: Making previously opaque data publicly available
  • AI-native: ML at the core of data production
  • Impact: Used by governments to detect illegal fishing, cited in policy decisions
  • Partnership model: Google, Oceana, SkyTruth as founding partners

What Struggles

  • AIS limitations: Not all vessels transmit AIS; deliberate disabling common
  • Verification: Satellite-derived classifications have error rates
  • Political pushback: Some nations resist external monitoring of their waters
  • Scope creep: Pressure to expand beyond fishing to other maritime activities

Lessons for Transport

  • Satellite + AI + open access can transform transparency in a sector
  • AIS for shipping is analogous to ADS-B for aviation or GTFS for transit -- existing data signals that can be aggregated
  • The model of using AI to classify and interpret raw signals could apply to transport
  • Expect political pushback when making data transparent that was previously opaque
  • Demonstrates that a focused coalition can build a globally significant platform

9. Biodiversity: Global Biodiversity Information Facility (GBIF)

Overview

GBIF is an international network and data infrastructure providing free, open access to biodiversity data.

Architecture

  • Federated network: 1,800+ data publishing institutions, 100+ participating countries
  • Standardised format: Darwin Core standard for biodiversity data
  • Data volume: 2.7 billion+ species occurrence records
  • Governance: Intergovernmental memorandum of understanding
  • Participant funding: Countries fund participation through national nodes

What Works

  • Standardisation: Darwin Core provides a universal data exchange format
  • Massive scale: Decades of consistent effort have built enormous data volume
  • Institutional commitment: Formal government participation
  • Incentive alignment: Researchers incentivised to contribute (data DOIs enable citation credit)

What Struggles

  • Taxonomic and geographic bias: Better data for well-studied groups and regions
  • Data quality: Citizen science data varies widely in quality
  • Funding: Core infrastructure funding perpetually under pressure
  • Timeliness: Much data is historical occurrence records, not real-time

Lessons for Transport

  • Data standards (equivalent of Darwin Core) are the foundation of interoperability
  • Incentive structures must reward data contribution
  • National nodes could be a model for transport data governance
  • Intergovernmental MoU provides legitimacy and commitment
  • Demonstrates that federated, open data can achieve global scale over time

10. Synthesis: What Makes These Systems Succeed or Fail

Success Factors

FactorHealth (WHO)Climate (CDS)Humanitarian (HDX)Biodiversity (GBIF)
Institutional mandateStrong (WHO Constitution)Strong (EU Copernicus)Medium (OCHA)Medium (MoU)
Data standardsStrong (ICD)Strong (CF Conventions)Medium (HDX schemas)Strong (Darwin Core)
Funding sustainabilityStrongStrongWeak-MediumMedium
User adoptionHighMedium-HighHighHigh (research)
AI/ML integrationLowMediumLowLow
Developing country accessMediumLow-MediumHighMedium
Political acceptanceHighHighHighHigh

Critical Success Factors for a Transport System

Based on the evidence from comparable systems:

  1. Data standards are non-negotiable: Every successful system has a foundational data standard. Transport needs its equivalent (building on GTFS, NeTEx, DATEX II, CDS, SIRI, etc.)

  2. Federated beats centralised: Most successful systems are federated (data stays distributed, the platform provides discovery and access) rather than centralised

  3. Institutional mandate matters: The stronger the mandate (ideally from an intergovernmental body), the more sustainable the platform

  4. Open access drives adoption: Platforms that are freely accessible achieve greater scale and impact

  5. AI integration is the frontier: None of the existing systems have deeply integrated AI/LLM capabilities -- this is an opportunity for the transport system to leapfrog

  6. Funding is the Achilles heel: Most platforms struggle with sustainable core funding. The transport system needs a clear business model from the start

  7. Start useful, scale ambitious: Platforms that launched with immediate utility (even if limited) and scaled over time are more successful than those that tried to boil the ocean

  8. Community and incentives: Users must be incentivised to both contribute and use the platform

Based on the analysis, the Global Intelligence System for Transport should consider a layered architecture drawing on the best elements of each precedent:

LayerModelTransport Equivalent
Data StandardsGBIF (Darwin Core)Transport Data Standards Framework (building on GTFS, NeTEx, etc.)
Data SharingHDX (open catalogue)Transport Data Exchange
Federated AccessCGIAR (GARDIAN)Federated query across national/organisational data
Analytical PlatformCopernicus CDS (toolbox)Transport Analysis Workbench
AI/Gap FillingClimate TRACE (satellite+AI)AI-derived transport data for data-poor regions
Intelligence InterfaceNovel (Transport Angel)LLM-powered chatbot for natural language data access
GovernanceGBIF (intergovernmental MoU)Transport Data Governance Framework under UN Decade

Document prepared: February 2026 Sources: Platform documentation, governance frameworks, published evaluations, and academic analyses of each system