Pages tagged parser:

XML Parsing with jQuery | Switch on the Code
http://www.switchonthecode.com/tutorials/xml-parsing-with-jquery

Parsing an XML feed or page using jQuery.
Sexy Lexing with Python | Evan Fosmark
http://www.evanfosmark.com/2009/02/sexy-lexing-with-python/
Lexical analysis, a daunting task right? Wrong! In the following document we'll walk through different methods of lexical scanning in Python. First, we'll look
Leo's Chronicle: 正規表現に見切りをつけるとき
http://leoclock.blogspot.com/2009/01/blog-post_27.html
正規表現では対応できない構文解析。
見切りというよりは適材適所。正規表現は入れ子と相性が悪いのでちゃんと構文解析しましょうと。
tenderlove's nokogiri at master — GitHub
http://github.com/tenderlove/nokogiri/tree/master
better than hpricot ;)
Nokogiri parses and searches XML/HTML faster than Hpricot, and also has correctly implemented CSS3 selector support as well as XPath support.
nokogiri is (one of) the fastest ruby XML parser. Parses and searches XML and HTML, and supports xpath and CSS3 selectors.
ParseKit - Cocoa Objective-C Framework for parsing, tokenizing and language processing
http://parsekit.com/
ParseKit Home Tokenization Grammars iPhone Source Code Header Docs ParseKit Documentation ParseKit ParseKit is a Mac OS X Framework written by Todd Ditchendorf in Objective-C 2.0 and released under the MIT Open Source License. ParseKit is suitable for use on Mac OS X Leopard or iPhone OS. The framework is an Objective-C implementation of the tools described in "Building Parsers with Java" by Steven John Metsker. ParseKit includes additional features beyond the designs from the book and also some changes to match common Cocoa/Objective-C conventions. These changes are relatively superficial, however, and Metsker's book is the best documentation available for ParseKit. The ParseKit Framework offers 3 basic services of general interest to Cocoa developers: String Tokenization via the Objective-C PKTokenizer and PKToken classes. High-Level Language Parsing via Objective-C - An Objective-C parser-building API (the PKParser class and sublcasses). Objective-C Parser Generation via Grammars
The ParseKit Framework offers 3 basic services of general interest to Cocoa developers: 1- String Tokenization via the Objective-C PKTokenizer and PKToken classes. 2- High-Level Language Parsing via Objective-C - An Objective-C parser-building API (the PKParser class and sublcasses). 3- Objective-C Parser Generation via Grammars - Generate an Objective-C parser for your custom language using a BNF-style grammar syntax (similar to yacc or ANTLR). While parsing, the parser will provide callbacks to your Objective-C code.
"The ParseKit Framework offers 3 basic services of general interest to Cocoa developers: String Tokenization via the Objective-C PKTokenizer and PKToken classes. High-Level Language Parsing via Objective-C - An Objective-C parser-building API (the PKParser class and sublcasses). Objective-C Parser Generation via Grammars - Generate an Objective-C parser for your custom language using a BNF-style grammar syntax (similar to yacc or ANTLR). While parsing, the parser will provide callbacks to your Objective-C code."
Python Package Index : topia.termextract 1.1.0
http://pypi.python.org/pypi/topia.termextract/
This package determines important terms within a given piece of content. It uses linguistic tools such as Parts-Of-Speech (POS) and some simple statistical analysis to determine the terms and their strength.
Term extraction and POS tagging in Python. Wonder how well the POS tagger performs.
Content Term Extraction using POS Tagging This package determines important terms within a given piece of content. It uses linguistic tools such as Parts-Of-Speech (POS) and some simple statistical analysis to determine the terms and their strength.
jParse - jQuery XML Parse Plugin
http://jparse.kylerush.net/
jParse is a jQuery plugin that allows you to quickly and easily parse XML.
Python Package Index : pdfminer 20090330
http://pypi.python.org/pypi/pdfminer/20090330
PDFMiner is a suite of programs that aims to help extracting or analyzing text data from PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other layout information such as font size or font name, which could be useful for analyzing the document. It can be also used as a basis for a full-fledged PDF interpreter.
The Magic Behind Parser Combinators - Code Commit
http://www.codecommit.com/blog/scala/the-magic-behind-parser-combinators
If you’re like me, one of the first things that attracted you to Scala was its parser combinators. Well, maybe that wasn’t the first thing for me, but it was pretty far up there. Parser combinators make it almost too easy to create a parser for a complex language without ever leaving the comfortable play-pen afforded by Scala. Incidentally, if you aren’t familiar with the fundamentals of text parsing, context-free grammars and/or parser generators, then you might want to do some reading before you continue with this article.
If you ever want to write a basic parser in Scala, take a look at this.
need
The If Works » Blog Archive » Talk: Writing a language in 15 minutes
http://blog.jcoglan.com/2009/05/19/talk-writing-a-language-in-15-minutes/
I gave a talk at London Ruby User Group yesterday, based on the work I’ve been doing on Heist, my Scheme interpreter project. I wrote the core of a basic Scheme interpreter in about 15 minutes as a live-coded demo (well, kind of – the coding was pre-recorded so I could focus on talking), which seemed to go down pretty well.
15分で作るScheme interpreter in Ruby.
A talk on how to implement a Scheme interpreter in 15 minutes. The parsing is taken care of primarily by Treetop, a syntactic analysis / parser creation DSL for Ruby.
argparse - Google Code
http://code.google.com/p/argparse/
The argparse module provides an easy, declarative interface for creating command line tools, which knows how to: * parse the arguments and flags from sys.argv * convert arg strings into objects for your program * format and print informative help messages * and much more... The argparse module improves on the standard library optparse module in a number of ways including: * handling positional arguments * supporting sub-commands * allowing alternative option prefixes like + and / * handling zero-or-more and one-or-more style arguments * producing more informative usage messages * providing a much simpler interface for custom types and actions
Enhanced command line parsing in Python.
cdecl: C gibberish ↔ English
http://cdecl.org/
python-sqlparse - Google Code
http://code.google.com/p/python-sqlparse/
sqlparse is a non-validating SQL parser for Python. It provides support for parsing, splitting and formatting SQL statements.
sqlparse is a non-validating SQL parser for Python. It provides support for parsing, splitting and formatting SQL statements.
Support for parsing, splitting and formatting SQL statements.
tinyrb
http://code.macournoyer.com/tinyrb/
"tinyrb is the smallest Ruby VM to date. It won't run all your Ruby code, but it's fast and fun to play with."
wee ruby VM in early development
smallest Ruby VM to date. It won't run all your Ruby code, but it's fast and fun to play with.
PEG.js – Parser Generator for JavaScript
http://pegjs.majda.cz/
enter your own lexer
Codekana blog » Blog Archive » On the Speed of Light, Innovation, and the Future of Parsing
http://www.codekana.com/blog/2009/04/02/on-the-speed-of-light-innovation-and-the-future-of-parsing/
Briljant
the viemu/codekana guy talks about his incremental parser
Paul Dix Explains Nothing: Feedzirra - A Ruby feed library built for speed
http://www.pauldix.net/2009/02/feedzirra-a-ruby-feed-library-built-for-speed.html
http://feeds.feedburner.com/PaulDixExplainsNothing
another ruby feed library. might not suck.
HTML Parsing and Screen Scraping with the Simple HTML DOM Library | Nettuts+
http://net.tutsplus.com/tutorials/php/html-parsing-and-screen-scraping-with-the-simple-html-dom-library/
If you need to parse HTML, regular expressions aren’t the way to go. In this tutorial, you’ll learn how to use an open source, easily learned parser, to read, modify, and spit back out HTML from external sources. Using nettuts as an example, you’ll learn how to get a list of all the articles published on the site and display them.