C# HTML Agility Pack XPath Issues -
i honest new html agility pack, have hit stump in road. goal parse data out of html page, when iterate through of divs need , time pull data each div keeps checking whole document instead of inside div. sorry if dumb question, come regex , having issues , lot of questions regarding parsing html. (lol used parse html regex). thing, if guys kind post below sites you'd recommend helping me learn html agility fantastic!
edit : forgot mention, below when select individual nodes did try // instead of ., had no luck @ all...
edit 2 : removed html page because know fact i'm able access data, issue wondering how instead of searching whole document, search in element
this code below, , below html parsing!
// grab daily bulletin foreach (htmlnode hn_post in hd.documentnode.selectnodes("//div[@class='newspostitem']")) { htmlnode hn_post_title = hn_post.selectsinglenode(".div[@class='newsposttitle']"); htmlnode hn_post_date = hn_post.selectsinglenode(".div[@class='newspubdate']"); htmlnode hn_post_notes = hn_post.selectsinglenode(".div[@class='newspostnotes']"); string final = string.format("title - {0} | date - {1} | body - {2}", hn_post_title.innertext, hn_post_date.innertext, hn_post_notes.innertext); final = final.replace("\n", string.empty); final = final.replace("\r", string.empty); final = final.replace("\t", string.empty); main_listbox.items.add(final); }
selectnodes()
, selectsinglenode()
accept xpath version 1.0 expression parameter. can learn xpath separately following of many tutorials on internet, , use 1 of many online xpath tester run & test xpath.
now, specific question, .div
not valid xpath expression. correct xpath query child element named div
class
attribute value equals "newsposttitle" current context element either of following :
./div[@class='newsposttitle'] div[@class='newsposttitle']
Comments
Post a Comment