Datasets & Data Vault
Iberia Travel Data Vault
Source registry, dataset notes, and transparent assumptions behind Odyssey Discoveries’ Spain and Portugal route comparisons.
Independent research. Source-aware analysis. Transparent assumptions.
Table of Contents
ToggleWhat the Data Vault is for
The Odyssey Discoveries Data Vault documents the sources, assumptions, and dataset structures used to support route reports, research briefs, and decision tools across Iberia.
This page is designed for readers who want to understand where the route data comes from, how assumptions are labelled, and which datasets are available, planned, or used internally to support travel comparisons.
Iberia-first route research, with city-pair comparisons and transport-mode assumptions.
Datasets are used to support route comparisons, not to claim live booking or operational data.
Routes and datasets should be reviewed when timetables, fares, operators, or assumptions change.
Assumptions should be visible, labelled, and linked to methodology where possible.
Dataset Catalog
Core datasets and research tables
Iberia Route Baseline Dataset
Structured city-pair table for Spain and Portugal routes, including origin city, destination city, main transport modes, likely stations or airports, and route-report status.
- Route ID
- Origin and destination city
- Available modes
- Research status
Carbon Assumptions Table
Notes on emissions assumptions used in route comparisons, including transport mode, distance logic, emissions-factor source, and confidence level.
- Mode category
- Emissions-factor source
- Distance basis
- Estimate quality
Airport & Station Access Assumptions
Working table for access-time assumptions, airport buffers, station friction, transfer penalties, and local-arrival notes used in route reports.
- Airport or station
- Access assumption
- Buffer assumption
- Friction notes
Source Registry
A curated directory of official and supporting sources used to verify transport, emissions, airport, rail, mapping, and route-comparison assumptions.
- Official transport sources
- Emissions references
- Geographic references
- Update notes
Source Directory
Primary source categories
Rail and timetable sources
Sources used for rail schedules, station references, and timetable checks across Spain and Portugal.
Airport and aviation sources
Sources used for airport passenger data, airport-level context, and aviation traffic references.
Transport statistics
Sources used for wider transport context, rail and air passenger trends, and country-level transport indicators.
Carbon and emissions sources
Sources used to support transport-emissions context and emissions-factor assumptions.
Geographic and route context
Sources used for mapping, distance checks, route geography, and station or airport location context.
Internal Odyssey sources
Internal research notes used to keep route pages consistent across time, cost, carbon, friction, and reliability.
Data Quality Rules
How data is treated before it appears in a route report
Official sources first
Timetables, airport statistics, national transport data, and emissions references are preferred from official or clearly documented sources whenever available.
Dates are part of the data
Route analysis should show when a page or assumption was last reviewed. Older data should be labelled rather than presented as current.
Assumptions are labelled
If a time, cost, carbon, or friction estimate relies on judgement, the assumption should be stated clearly.
No false live-data claims
Unless explicitly connected to a live data source, Odyssey Discoveries should not claim real-time fares, real-time delays, or live availability.
Data Dictionary
Recommended fields for route datasets
| Field | Meaning | Example |
|---|---|---|
| route_id | Unique route identifier used internally. | mad-bar |
| origin_city | Starting city for the route comparison. | Madrid |
| destination_city | Destination city for the route comparison. | Barcelona |
| mode | Transport option being compared. | Train, flight, bus |
| scheduled_time_min | Published or estimated scheduled travel time in minutes. | 150 |
| access_time_min | Estimated time to reach station or airport. | 35 |
| buffer_time_min | Recommended buffer for check-in, security, boarding, or connection risk. | 60 |
| transfer_count | Number of major transfers in the route. | 1 |
| carbon_source | Source or assumption used for emissions estimate. | EEA / UK GHG factor / route assumption |
| estimate_quality | Confidence label for the estimate. | Official, derived, estimated |
| last_checked | Date the source or assumption was last reviewed. | 2026-05-06 |
FAQ
Data Vault FAQ
Are all datasets downloadable?
Not yet. Some datasets are available as visible source notes or methodology tables, while others are still being prepared. The page should not claim downloadable files until a CSV, spreadsheet, or PDF is actually published.
Is this live transport data?
No. Unless a page explicitly says otherwise, Odyssey Discoveries uses source-backed research and route assumptions rather than live fares, live delays, or live booking availability.
How are emissions assumptions handled?
Carbon estimates are treated as decision-support figures. They should show the source, transport mode, distance logic, and whether the estimate is official, derived, or approximate.
Why does the Data Vault include external sources?
Route comparisons are stronger when readers can see the source trail. The Data Vault helps users understand which official or supporting references inform the analysis.
Use the data behind smarter route decisions
Explore the methodology or compare a real Iberia route using time, cost, carbon, and travel-friction assumptions.
Notifications