Supply chain data: One huge denormalized table with one line per product flow between locations. These type of datasets, though format specific to Sonae, are general data sets for the retail sector. All the datasets are created in our operational systems, collected in our on premises data warehouse, and made available to 3rd parties through Amazon AWS S3/Redshift.
Data Provider Country
The dataset used in the experiment will have a bespoken update frequency to be decided with the challenge winner.
20TB of compressed data (1/10 ratio)
Number of attributes
Format and storage
Csv files stored in Amazon AWS
- Supply chain data – One denormalized table with one line per product flow between locations
No data relating to persons present
No Synthetic data present
Timespan & Production
Timespan: Jan 2017 – present
Level of aggregation