Parse any text for entries using regular expression. This doesn't work well with HTML, please use html instead.
text:
url: <url>
entry:
<field>: <regexp to match value>
format:
<field>: <python string formatting>
encoding: <encoding of file (defaults to utf-8)>
Format can be used to modify matched values. All fields that were parsed are usable. You can also make temp fields for data if you need.
Note: each entry must have at least two fields, title and url.
text:
url: http://www.nbc.com/Heroes/js/novels.js
# construct our entry with this data
entry:
title: novelTitle = "(.*)"
url: novelPrint = "(.*)"
# further format entry
format:
url: http://www.nbc.com%(url)s