Publication
Scalable modelling and recommendation using wiki-based crowdsourced repositories
| dc.contributor.author | Leal, Fátima | |
| dc.contributor.author | Veloso, Bruno | |
| dc.contributor.author | Malheiro, Benedita | |
| dc.contributor.author | González-Veléz, Horacio | |
| dc.contributor.author | Burguillo, Juan Carlos | |
| dc.date.accessioned | 2019-03-12T16:37:36Z | |
| dc.date.embargo | 2119 | |
| dc.date.issued | 2019 | |
| dc.date.updated | 2019-03-08T15:50:56Z | |
| dc.description.abstract | Wiki-based crowdsourced repositories have increasingly become an important source of information for users in multiple domains. However, as the amount of wiki-based data increases, so does the information overloading for users. Wikis, and in general crowdsourcing platforms, raise trustability questions since they do not generally store user background data, making the recommendation of pages particularly hard to rely on. In this context, this work explores scalable multi-criteria profiling using side information to model the publishers and pages of wiki-based crowdsourced platforms. Based on streams of publisher-page-review triads, we have modelled publishers and pages in terms of quality and popularity using different criteria and user-page-view events collected via a wiki platform. Our modelling approach classifies statistically, both page-review (quality) and page-view (popularity) events, attributing an appropriate rating. The quality-related information is then merged employing Multiple Linear Regression as well as a weighted average. Based on the quality and popularity, the resulting page profiles are then used to address the problem of recommending the most interesting wiki pages per destination to viewers. This paper also explores the parallelisation of profiling and recommendation algorithms using wiki-based crowdsourced distributed data repositories as data streams via incremental updating. The proposed method has been successfully evaluated using Wikivoyage, a tourism crowdsourced wiki-based repository. | pt_PT |
| dc.description.version | info:eu-repo/semantics/publishedVersion | pt_PT |
| dc.identifier | 15674223 | en_US |
| dc.identifier.doi | 10.1016/j.elerap.2018.11.004 | pt_PT |
| dc.identifier.issn | 15674223 | |
| dc.identifier.uri | http://hdl.handle.net/10400.22/12974 | |
| dc.language.iso | eng | pt_PT |
| dc.publisher | Elsevier | pt_PT |
| dc.relation.publisherversion | https://www.sciencedirect.com/science/article/pii/S1567422318300826?via%3Dihub | pt_PT |
| dc.subject | Modelling | pt_PT |
| dc.subject | Scalable data mining | pt_PT |
| dc.subject | Wiki-based crowdsourcing | pt_PT |
| dc.subject | Parallel processing | pt_PT |
| dc.subject | Reputation | pt_PT |
| dc.subject | User profiling | pt_PT |
| dc.subject | Cloud computing | pt_PT |
| dc.subject | Recommender systems | pt_PT |
| dc.title | Scalable modelling and recommendation using wiki-based crowdsourced repositories | pt_PT |
| dc.type | journal article | |
| dspace.entity.type | Publication | |
| oaire.citation.title | Electronic Commerce Research and Applications | pt_PT |
| oaire.citation.volume | 33 | pt_PT |
| person.familyName | BENEDITA CAMPOS NEVES MALHEIRO | |
| person.givenName | MARIA | |
| person.identifier.ciencia-id | 7A15-08FC-4430 | |
| person.identifier.orcid | 0000-0001-9083-4292 | |
| rcaap.rights | closedAccess | pt_PT |
| rcaap.type | article | pt_PT |
| relation.isAuthorOfPublication | babd4fda-654a-4b59-952d-6113eebbb308 | |
| relation.isAuthorOfPublication.latestForDiscovery | babd4fda-654a-4b59-952d-6113eebbb308 |
