Tropo / Dave / Bookmarks : nlp

Home - GitHub
    Project Goose is an article extractor written in Java using Maven for the dependencies. It's an open source project born from Gravity Labs http://gravity.com, Its goal is to take a webpage, perform calculations and extract the main text of the article as well as make recommendations on what image might be the most relevant image on the page. 
    https://github.com/jiminoc/goose/wiki
    tags: nlp ir

Treebank tokenization
    Our tokenization is fairly simple: most punctuation is split from adjoining words double quotes (") are changed to doubled single forward- and backward- quotes (`` and '') verb contractions and the Anglo-Saxon genitive of nouns are split into their component morphemes, and each morpheme is tagged separately. Examples children's --> children 's parents' --> parents ' won't --> wo n't gonna --> gon na I'm --> I 'm
    http://www.cis.upenn.edu/~treebank/tokenization.html
    tags: nlp tokenization

LingPipe: Competition
    On this page, we break our competition down into academic toolkits and industrial toolkits. We only consider software that is available for linguistic processing, not companies that rely on linguistic processing in an application but do not sell that technology. How does LingPipe compare to the below offerings?
    http://alias-i.com/lingpipe/web/competition.html
    tags: ir nlp

Statistical NLP / corpus-based computational linguistics res...
    http://www-nlp.stanford.edu/links/statnlp.html
    tags: ir nlp

montylingua :: a free, commonsense-enriched natural language...
    MontyLingua is a free*, commonsense-enriched, end-to-end natural language understander for English. Feed raw English text into MontyLingua, and the output will be a semantic interpretation of that text. Perfect for information retrieval and extraction, request processing, and question answering. From English sentences, it extracts subject/verb/object tuples, extracts adjectives, noun phrases and verb phrases, and extracts people's names, places, events, dates and times, and other semantic information. MontyLingua makes traditionally difficult language processing tasks trivial!
    http://web.media.mit.edu/~hugo/montylingua/index.html
    tags: python nlp

ConceptNet
    What is ConceptNet? [top] ConceptNet is a freely available commonsense knowledgebase and natural-language-processing toolkit which supports many practical textual-reasoning tasks over real-world documents right out-of-the-box (without additional statistical training) including ...
    http://web.media.mit.edu/~hugo/conceptnet/
    tags: nlp cyc wordnet python

Grand Text Auto » The Story of Meehan’s Tale-Spin
    http://grandtextauto.org/2006/09/13/the-story-of-meehans-tal...
    tags: talespin uci meehan nlp storygeneration

The Stanford NLP (Natural Language Processing) Group
    Named Entity Recognition (NER) and Information Extraction (IE)
    http://nlp.stanford.edu/ner/index.shtml
    tags: nlp ir ie namedentity

About TextTiling
    http://people.ischool.berkeley.edu/~hearst/tiling-about.html
    tags: ir nlp segmentation tokenization passage

Kea - keyphrase extraction
    KEA is an algorithm for extracting keyphrases from text documents. It can be either used for free indexing or for indexing with a controlled vocabulary.
    http://www.nzdl.org/Kea/
    tags: ir nlp search kea

MULTI-PARAGRAPH SEGMENTATION OF EXPOSITORY TEXT
    his paper describes TextTiling, an algorithm for partitioning expository texts into coherent multi-paragraph discourse units which reflect the subtopic structure of the texts. The algorithm uses domain-independent lexical frequency and distribution inform
    http://people.ischool.berkeley.edu/~hearst/papers/tiling-acl...
    tags: nlp ir

Main Page - NLTK
    NLTK — the Natural Language Toolkit — is a suite of open source Python modules, data and documentation for research and development in natural language processing. NLTK contains Code supporting dozens of NLP tasks, along with 40 popular Corpora and ex
    http://nltk.org/index.php/Main_Page
    tags: python Linguistics nlp

 


Search for nlp on del.icio.us