• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Faculty of Social Sciences
  • Department of Information Science and Media Studies
  • Department of Information Science and Media Studies
  • View Item
  •   Home
  • Faculty of Social Sciences
  • Department of Information Science and Media Studies
  • Department of Information Science and Media Studies
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Word sense disambiguation in webpages. Developing a program capable to disambiguate words with a website text as context

Sekkingstad, Andreas
Master thesis
Thumbnail
View/Open
150483671.pdf (1.090Mb)
URI
http://hdl.handle.net/1956/15606
Date
2016-12-01
Metadata
Show full item record
Collections
  • Department of Information Science and Media Studies [826]
Abstract
This master thesis investigated automatic methods of Word Sense Disambiguation (WSD) in HTML pages. The hypothesis was that HTML documents provide various disambiguation cues which are not normally present in general text, and which can enhance the quality of WSD. We tested several existing natural language processing toolkits which provide general WSD services, and compared these to our novel algorithms which were designed to take advantage of the HTML cues. The findings showed that our new algorithms outperformed state of the art general WSD implementations. In addition, our algorithm could provide a ranked list of potential disambiguations, which is useful in an example use case where users “tag” key words in a web page with the help of the disambiguating algorithm.
Publisher
The University of Bergen
Copyright
Copyright the author. All rights reserved.

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit