- #1
John Creighto
- 495
- 2
I'm trying to create an rss feed from a html table on the main page of a web forum. I want to do this because the table displays the new posts.
I use yahoo pipes and you can see my attempt here:
http://pipes.yahoo.com/pipes/pipe.info?_id=0e72fee43090386fddbc9191f5cddc86
The pipes work up to the regular expression block.
The input to my regular expression block is:
I try to extract the title as follows:
the poster as follows;
and the topic description as follows:
For each of the above regular expressions, the match is replaced with what is inside the brackets. Unfortunately none of my expressions are matching. I'm not sure what characters I need to escape but it is suppose to be based on perl and wikipedia tells me that all non alphanumeri characters in perl can be replaced by a backslash.
I use yahoo pipes and you can see my attempt here:
http://pipes.yahoo.com/pipes/pipe.info?_id=0e72fee43090386fddbc9191f5cddc86
The pipes work up to the regular expression block.
The input to my regular expression block is:
Code:
<a rel="nofollow" target="_blank" href="http://thepeacearch.com/forum/showthread.php?t=16681" title="So, recently I've been looking for ways to make my online time more efficient. One thing I'm looking to do is find ways to combine information from...">My Enviornment rss Feed</a>
<div class="smallfont">
<span style="cursor:pointer;">s243a</span>
</div>
<div class="smallfont">Today <span class="time">02:47 AM</span></div>
<div class="smallfont" style="text-align:right;white-space:nowrap;">
Today <span class="time">02:47 AM</span><br />
by <a rel="nofollow" target="_blank" href="http://thepeacearch.com/forum/member.php?find=lastposter&t=16681">s243a</a> <a rel="nofollow" target="_blank" href="http://thepeacearch.com/forum/showthread.php?p=295884#post295884"><img alt="" border="0" src="http://thepeacearch.com/images/lustrous/buttons/lastpost.gif" title="Go to last post"/></a>
</div>
<span class="smallfont">0</span>
<span class="smallfont">3</span>
<span class="smallfont">0</span>
<span class="smallfont">3</span>
I try to extract the title as follows:
Code:
^.*title=\".*\">(.*)<\/a>.*
the poster as follows;
Code:
^.*<span style=\"cursor\:pointer\;\"(.*)\<\/span\>.*
and the topic description as follows:
Code:
^.*title\=\"(.*)\".*
For each of the above regular expressions, the match is replaced with what is inside the brackets. Unfortunately none of my expressions are matching. I'm not sure what characters I need to escape but it is suppose to be based on perl and wikipedia tells me that all non alphanumeri characters in perl can be replaced by a backslash.
Last edited by a moderator: