The SmartOpenData infrastructure is depicted in the following figure where three main elements can be identified.
In the lower level the external data sources are depicted. Data sources can be grouped in two different sets. The first one is composed by data sources that fulfil some of the standards supported by SmartOpenData (green boxes). The second group is composed by data sources that does not fulfil those standards (blue boxes). In the upper layer, three different scenarios have being identified: scenario for researches, scenario for companies and scenario for end-users. Each scenario will focus on one specific segment using the functionalities provided by the SmartOpenData System, creating services that take advantage of such data and provide valuable services for each community illustrating how the availability of such services and the corresponding data can provide advantages for them.
Between the external data sources and the data consumer in the scenarios the SmartOpenData System is placed providing key functionalities. The most basic element of the SmartOpenData System is the harmonisation of data sources. This element offers an open data source layer that exposes the external data sources fully adapted to the open data standards supported by the project. If an external data source does not provide the information according to the required standard, and adaptation is required, which is depicted in the figure as an extra box, which provides such adaptation specifically tuned for each external data source. The open data source layer provided both semantic information of the data and data themselves.
Over this open data source layer, three key functionalities are defined:
- (i) distributed semantic indexing, which provides a service for searching and locating data based on semantic information collected from all the available Data Sources;
- (ii) distributed data access, which provides data collected from external data sources, as an extra data source for easier and uniform data gathering from the users at the identified scenarios;
- (iii) administration and notification, which provides administration facilities for managing users, workflows and data to data providers.
These three functional components are communicated and coordinated inside the SmartOpenData System, creating a distributed service system which can be accesses transparently from the scenarios. It is also important to note that it will be possible for services created on the scenarios to access directly external data sources selected thought the distributed semantic indexing functionality of the SmartOpenData System if they are provided using one standard as shown on the previous picture.