I need to read a .pdf file and generate reports based on the data from that file.
I once made a specific system for a graphic that interpreted the responses of the candidates in a public contest. The system interpreted the markings (of the answers) of the students and crossed the data with the test template, generating a report based on this data, etc.
Finally, it happens that "reading" a .pdf, which contains texts, numbers, etc .. is something much more complex.
What is the best way to tackle this type of problem? Someone can help me with some material or any other clue.