Analysis of compounds activity concept learned by SVM using robust Jaccard based low-dimensional embedding

2015
journal article
article
1
cris.lastimport.scopus2024-04-07T14:38:54Z
cris.lastimport.wos2024-04-10T02:13:29Z
dc.abstract.enSupport Vector Machines (SVM) with RBF kernel is one of the most successful models in machine learning based compounds biological activity prediction. Unfortunately, existing datasets are highly skewed and hard to analyze. During our research we try to answer the question how deep is activity concept modeled by SVM. We perform analysis using a model which embeds compounds' representations in a low-dimensional real space using near neigh- bour search with Jaccard similarity. As a result we show that concepts learned by SVM is not much more complex than slightly richer nearest neighbours search. As an additional result, we propose a classi cation technique, based on locally sensitive hashing approximating the Jaccard similarity through minhashing technique, which performs well on 80 tested datasets (consisting of 10 proteins with 8 di erent representations) while in the same time allows fast classi cation and ecient online training.pl
dc.affiliationWydział Matematyki i Informatyki : Instytut Informatyki i Matematyki Komputerowejpl
dc.contributor.authorJastrzębski, Stanisław - 207335 pl
dc.contributor.authorCzarnecki, Wojciech - 115076 pl
dc.date.accessioned2016-06-16T12:21:52Z
dc.date.available2016-06-16T12:21:52Z
dc.date.issued2015pl
dc.date.openaccess0
dc.description.accesstimew momencie opublikowania
dc.description.physical9-19pl
dc.description.versionostateczna wersja wydawcy
dc.description.volume24pl
dc.identifier.doi10.4467/20838476SI.15.001.3023pl
dc.identifier.eissn2083-8476pl
dc.identifier.issn1732-3916pl
dc.identifier.projectROD UJ / Ppl
dc.identifier.urihttp://ruj.uj.edu.pl/xmlui/handle/item/28039
dc.languageengpl
dc.language.containerengpl
dc.rightsDodaję tylko opis bibliograficzny*
dc.rights.licenceOTHER
dc.rights.urihttp://ruj.uj.edu.pl/4dspace/License/copyright/licencja_copyright.pdf*
dc.share.typeotwarte czasopismo
dc.subject.ensupport vector machinespl
dc.subject.enlocally sensitive hashingpl
dc.subject.enjaccard similaritypl
dc.subtypeArticlepl
dc.titleAnalysis of compounds activity concept learned by SVM using robust Jaccard based low-dimensional embeddingpl
dc.title.journalSchedae Informaticaepl
dc.typeJournalArticlepl
dspace.entity.typePublication
Affiliations

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views
0
Views per month