Open the Option Dialog and create a "new html feed". Type the url of the HTML page and give a name to the new feed. Then click on the "Parse HTML" button to set param for the HTML parser. You'd better know what a Reg Exp !. For the "headline", "article", ... at the bottom, you must type $1 or $2, ... depending on the position of the "()" in the Reg Exp above. You can add more info with the $#; ie. "$1". To train, you can create a html feed, and set it as:
reg exp: <tr>[^']*'([^']*)[^>]*>([^<]*)</a>[^<]*</b>[^<]*</td>[^<]*</tr>[^<]*<tr>[^<]*<td>[^<]*<small>([^<]*)</small>[^<]*</td>[^<]*</tr>[^<]*<tr>[^<]*<td>([^<]*)
headline : $2
article : $4
published date : $3
link : $1
Then, click on the "Test" button to test your Reg Exp. You can limit the parsing process to a portion of the html code with the "start" and "end" field (also Reg Exp). If you want to be helped when building your Reg Exp, locate and select the HTML code in the "source" tab, then click on the "Build" button. It will propose a good start for your Reg Exp.

Previous: Option Dialog   Index   Next: Adding a feed/group

The inforss project can be contacted through the mailing list or the member list.
Copyright © 2000-2019. All rights reserved. Terms of Use & Privacy Policy.