Wednesday, February 10th, 2010
Web-Scale Distributional Similarity and Entity Set Expansion – Patrick Pantel, Eric Crestan, Arkady Borkovsky, Ana-Maria Popescu and Vishnu Vyas. 2009 This paper implements web scale word similarity metrics and uses this to test performance on a set expansion task. The authors implement a distributed algorithm to compute cosine similarity between context vectors for 500 million terms […]
Wednesday, February 10th, 2010
Entity Extraction via Ensemble Semantics – Pennacchiotti & Patel (2009) This paper proposes a new framework for information extraction called Ensemble Semantics. The Authors describe a Knowledge Extraction framework which collects information from multiple knowledge sources. They then use multiple knowledge extractors and feature extractors to extract candidate relations and features of relation reliability. They […]
Wednesday, November 11th, 2009
Against Markedness (and What to Replace it With) – Haspelmath (2006) Haspelmath argues that notions of markedness in linguistics are unmotivated . He instead proposes to replace the notions by detailed analysis and explanations based on textual frequency. He systematically examines 12 different uses of markedness throughout the linguistic literature showing in each case why […]