I am doing a job for college and would like to get the income and the public from every Brazilian championship game of the last few years. The CBF makes available in a series of links, an example is the Borderô . For other similar problems I use the package tabulizer
, as in the code below
library(tabulizer)
url <- 'https://conteudo.cbf.com.br/sumulas/2014/1421b.pdf'
d <- extract_tables(url, encoding = "UTF-8")
For tables created in PDF it works perfectly, but for this type of pdf (which was probably printed, scanned and then saved in pdf) does not work, the code returns a list with 0 elements. Any ideas or packages that I can use?