Simple view
Full metadata view
Authors
Statistics
Classifying the Wikipedia articles into the OpenCyc taxonomy
This article presents a method of classification of the Wikipedia articles into the taxonomy of OpenCyc. This method utilises several sources of the classification information, namely the Wikipedia category system, the infoboxes attached to the articles, the first sentences of the articles, treated as their definitions and the direct mapping between the articles and the Cyc symbols. The classification decision made using these methods are accommodated using the Cyc built-in inconsistency detection mechanism. The combination of the best classification methods yields 1,47 millions of classified articles and has a manually verified precision above 97%, while the combination of all of them yields 2.2 millions of articles with estimated precision of 93%.
dc.abstract.en | This article presents a method of classification of the Wikipedia articles into the taxonomy of OpenCyc. This method utilises several sources of the classification information, namely the Wikipedia category system, the infoboxes attached to the articles, the first sentences of the articles, treated as their definitions and the direct mapping between the articles and the Cyc symbols. The classification decision made using these methods are accommodated using the Cyc built-in inconsistency detection mechanism. The combination of the best classification methods yields 1,47 millions of classified articles and has a manually verified precision above 97%, while the combination of all of them yields 2.2 millions of articles with estimated precision of 93%. | pl |
dc.affiliation | Wydział Zarządzania i Komunikacji Społecznej : Katedra Lingwistyki Komputerowej | pl |
dc.conference | 11th International Semantic Web Conference (ISWC 2012) | pl |
dc.conference.city | Boston | |
dc.conference.country | USA | |
dc.conference.datefinish | 2012-11-11 | |
dc.conference.datestart | 2012-11-11 | |
dc.contributor.author | Smywiński-Pohl, Aleksander - 173398 | pl |
dc.contributor.editor | Rizzo, Giuseppe | pl |
dc.contributor.editor | Mendes, Pablo | pl |
dc.contributor.editor | Charton, Eric | pl |
dc.contributor.editor | Hellmann, Sebastian | pl |
dc.contributor.editor | Kalyanpur, Aditya | pl |
dc.date.accession | 2016-05-10 | pl |
dc.date.accessioned | 2016-05-12T05:55:46Z | |
dc.date.available | 2016-05-12T05:55:46Z | |
dc.date.issued | 2012 | pl |
dc.date.openaccess | 0 | |
dc.description.accesstime | w momencie opublikowania | |
dc.description.additional | Publikacja nie posiada numeru ISBN. Bibliogr. s. 15-16 | pl |
dc.description.conftype | international | pl |
dc.description.physical | 5-16 | pl |
dc.description.publication | 0,8 | pl |
dc.description.series | CEUR workshop proceedings | |
dc.description.seriesnumber | 906 | |
dc.description.version | ostateczna wersja wydawcy | |
dc.identifier.seriesissn | 1613-0073 | |
dc.identifier.uri | http://ruj.uj.edu.pl/xmlui/handle/item/25546 | |
dc.identifier.weblink | http://ceur-ws.org/Vol-906/paper2.pdf | pl |
dc.language | eng | pl |
dc.language.container | eng | pl |
dc.pubinfo | Aachen : Technical University of Aachen | pl |
dc.rights.licence | OTHER | |
dc.share.type | inne | |
dc.subtype | ConferenceProceedings | pl |
dc.title | Classifying the Wikipedia articles into the OpenCyc taxonomy | pl |
dc.title.container | Proceedings of the Web of Linked Entities Workshop in conjunction with the 11th International Semantic Web Conference (ISWC 2012) | pl |
dc.type | BookSection | pl |
dspace.entity.type | Publication |