Questions tagged as 'scraping'

1
answer

Pick up news from a specific site

At the moment I'm looking for a solution to implement a module in my App to get news from a specific site and show it on the news module screen. Can anyone tell me a way?     
asked by 24.09.2015 / 13:18
1
answer

What does this anti-theft code in Javascript do?

What does this anti-theft code in Javascript do? <html><head></head><body onload="challenge();"> <script> eval(function(p,a,c,k,e,r){e=function(c){return c.toString(a)};if(!''.replace(/^/,String)){while(c--)r[e(c)...
asked by 19.05.2015 / 16:47
1
answer

Protect automated access web pages

How can I protect my web pages from being accessed in an automated way? By search engine bots like Googlebot (I think the basic form was the meta tag with noindex and nofollow). By Headless Browser (browsers without graphical interface and...
asked by 20.05.2015 / 13:57
2
answers

Scraping in Python - read pdf

I've done a scrapping in Python which takes a URL from any PDF, reads and returns, however in some PDFs I'm having the problem come with some characters like this:    ". \ nO \ xc3 \ xb3rg \ xc3 \ xa3o also \ xc3 \ xa9m discloses   result \...
asked by 19.09.2016 / 21:42
2
answers

Web scraping with pure Javascript

I want to do a web scraping that reads an XML page and takes a certain value that is in "name", but I'm not sure exactly if it's possible - I just found out how to do with NodeJS - is it possible to do with pure JS? No external libraries and / o...
asked by 05.11.2015 / 18:29
0
answers

Problems accessing a site via RStudio

Problems connecting to a particular site via RStudio url <- "https://www.jusbrasil.com.br/diarios/busca?q=%22licen%C3%A7a+sem+vencimentos%22&idtopico=T10001849&o=data" links <- read_html(url) %>% html_nodes('.DocumentSnippet'...
asked by 29.05.2018 / 16:52
1
answer

What Threads do you share when making Http calls?

Problem: I can not log in to a site more than once in%% of different%. My application is Threads . If I open multiple executable , each can login successfully on a Console application , only the first . I believe this e...
asked by 26.05.2016 / 01:38
1
answer

Scrapping of HTML with pure Javascript

Hello, I have an html that has the following sample sequence: <links class="canais-teste"> CANAL 1 <links/> There are about 80 excerpts of these, I wanted to get only the contents of these tags, in the case of "CHANNEL1"...
asked by 03.03.2017 / 16:55
0
answers

Scraping with R - xpathSApply returning a list of 0

I'm learning to read data in XML in R. I would like to extract the information of the Brazilian football (name of the championship, game principal, result, etc.) of this site: link with the XML package. My code looks like this: [1] fileUr...
asked by 02.11.2017 / 13:01
1
answer

BeautifulSoup - Real href links

I was studying about WebScraping with Python and started using the bs4 library (BeautifulSoup). When I started to get the tags a and the href attribute, I realized that I could not access the link if in href had something li...
asked by 07.11.2017 / 08:58