C# project, scrape from HTML using XPATH (cut info needed
$30-250 USD
Cerrado
Publicado hace más de 13 años
$30-250 USD
Pagado a la entrega
I need experienced developer on C# (OTHER LANGUAGES NOT ACCEPTED)
I have a set of XPATH expressions and need apply these to string with source code of a webpage
You need take a string(contains source HTML code) and apply these XPATH expressions to get the content. Clean it and give me the result
Remember you need check XPATH & clean all content (delete ads, other buttons like shareit,comments,etc.)
Test it with a large list of sites and give me the list. I will check the process with these and other sites
Extract:
Title
Content
Tags
Clean:
Delete Pieces of code with structure...(XPATH)
Delete Plugins like shareit..
Delete comments,footpost,etc
I have arrays with xpaths that u need test & improve (I have a list for all types: Title,content,tags...etc)
I have this working in php I will show u some pieces of code
I need only Title,content,tags (delete from string all other things)
You need develop these functions to implement on our software, dont worry about connections to get the website, use it on static way...we can make integrations after. To test think about command line with arg name of file with HTML source code
Its an simple job for good C# programmers
Hi there. Just finished a project that involved scrapping data from html pages using C#, sgml reader and xmldocument. All of the extractions was made by xpath queries.