Dismiss Notice
Join Physics Forums Today!
The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

Regular Expressions and Yahoo Pipes

  1. Jul 24, 2009 #1
    I'm trying to create an rss feed from a html table on the main page of a web forum. I want to do this because the table displays the new posts.

    I use yahoo pipes and you can see my attempt here:
    http://pipes.yahoo.com/pipes/pipe.info?_id=0e72fee43090386fddbc9191f5cddc86 [Broken]

    The pipes work up to the regular expression block.

    The input to my regular expression block is:

    Code (Text):
    <a rel="nofollow" target="_blank" href="http://thepeacearch.com/forum/showthread.php?t=16681" title="So, recently I've been looking for ways to make my online time more efficient. One thing I'm looking to do is find ways to combine information from...">My Enviornment rss Feed</a>

    <div class="smallfont">

    <span style="cursor:pointer;">s243a</span>


    <div class="smallfont">Today <span class="time">02:47 AM</span></div>


    <div class="smallfont" style="text-align:right;white-space:nowrap;">
    Today <span class="time">02:47 AM</span><br />
    by <a rel="nofollow" target="_blank" href="http://thepeacearch.com/forum/member.php?find=lastposter&amp;t=16681">s243a</a> <a rel="nofollow" target="_blank" href="http://thepeacearch.com/forum/showthread.php?p=295884#post295884"><img alt="" border="0" src="http://thepeacearch.com/images/lustrous/buttons/lastpost.gif" title="Go to last post"/></a>

    <span class="smallfont">0</span>

    <span class="smallfont">3</span>

    <span class="smallfont">0</span>

    <span class="smallfont">3</span>
    I try to extract the title as follows:

    Code (Text):
    the poster as follows;

    Code (Text):
    ^.*<span style=\"cursor\:pointer\;\"(.*)\<\/span\>.*
    and the topic description as follows:

    Code (Text):
    For each of the above regular expressions, the match is replaced with what is inside the brackets. Unfortunately none of my expressions are matching. I'm not sure what characters I need to escape but it is suppose to be based on perl and wikipedia tells me that all non alphanumeri characters in perl can be replaced by a backslash.
    Last edited by a moderator: May 4, 2017
  2. jcsd
  3. Jul 24, 2009 #2
    Well, I don't know about yahoo pipes regexes, but in the absence of documentation (I didn't find any good documentation in a quick search) you might as well try experimenting.
    Can you match:
    h(orse) (replace the orse with something else)
    Last edited: Jul 24, 2009
  4. Jul 27, 2009 #3
    I was able to figure most of what I want out with help from this post:

    http://discuss.pipes.yahoo.com/Message_Boards_for_Pipes/threadview?m=tm&bn=pip-DeveloperHelp&tid=9741&mid=9742&tof=3&rt=2&frt=2&off=1 [Broken]

    I'll I need to do now is figure out how to format the date:

    http://discuss.pipes.yahoo.com/Message_Boards_for_Pipes/threadview?m=tm&bn=pip-DeveloperHelp&tid=9749&mid=9749&tof=1&frt=2 [Broken]
    Last edited by a moderator: May 4, 2017
Share this great discussion with others via Reddit, Google+, Twitter, or Facebook