This project is about modelling, designing and implementing a modular and extensible platform to collect, process and store Web data. The platform will have the following components:
(a) Recollection and extraction of Web dataThis platform not only allows to develop faster new Web mining applications, but also allows to repeat the process each time the Web data changes (the Web is intrinsically very dynamic), as views and relations are generated by a sequence of operations in the mentioned algebra.
The project intends to strengthen the training of PhD Students and the capabilities of researchers in the key area of Multimedia Information Retrieval, through the network flows, through extensive use of Internet (setting up collaborative tools); and through the creation of a body of high-quality educational content delivered throuhgh Internet.