Hi everyone,
I need help on forming a regular expression that can extract an article body. For example articles on ZDNet or Cnews ..etc..
I know that the body of an article will usually have continuous words .. go to http://www.zdnet.com/zdnn/stories/news/0,4586,5099135,00.html?chkpt=zdhpnews01
article for example..
the beginning of an article is like this ..
"Intel will release new chips at the Comdex trade show, its first low-power designs for super-thin servers squeezed into cabinets by the dozens, a source familiar with the plan said."
the ending is like this..
"A lot of those guys were coming on stream just when the economy decided to take its downward turn," Brookwood said, but also, the power-saving difference just wasn't that big between Intel and Transmeta."
so ... how do i extract everything in between and of course the start and end block of text as show above.. 🙂
SO.. anyone knows how do i tell php to extract something the moment they see a continuous stream of text... exceptional case would be characters such as < > , " ' ; ... those characters can be considered too .. as sometimes in the article they have the paragraph tag or break tag...
i need a generic solution that can work for most articles...
thank you very very much 🙂