MHB Code language for editing PDF files

AI Thread Summary
To edit around 24 PDF files by deleting the first page or specific text, Python is recommended as the most convenient programming language due to its robust libraries for handling PDFs. For deleting the first page, using a PDF editor or viewer may be the quickest solution. If text manipulation is required, Python libraries like PyPDF2 or PDFMiner can be utilized. Resources such as BeautifulSoup for web scraping are also mentioned, but they are less relevant for direct PDF editing. Overall, Python offers a straightforward approach for both tasks, making it suitable for amateur coders looking to expand their skills.
linag96
Messages
3
Reaction score
0
I have around ~24 PDF files I want to edit. I want to delete the first page of each file OR scan for certain text in the file/delete the text (whichever implementation is easier). What would be the most convenient and straight forward coding language to do this with and what would be my starting guide to creating this program?
I am an amateur coder, I love learning coding. I have about 4 months of experience with C++. I used a lot of MatLab last year for my math classes, I also learned some command line. I'm open to learning anything new. Thank you for your help.
 
Technology news on Phys.org
Like web crawling (or scraping)?

If you are viewing them in browser i guess using Javascript would be the best way to crawl. If not, probably Python.

Some interesting resources here and http://www.nyu.edu/projects/politicsdatalab/localdata/workshops/BeautifulSoup.pdf.

However if you only want to delete the first page of each, then the editor or previewer you are using to view them will likely be the fastest way.
 
Dear Peeps I have posted a few questions about programing on this sectio of the PF forum. I want to ask you veterans how you folks learn program in assembly and about computer architecture for the x86 family. In addition to finish learning C, I am also reading the book From bits to Gates to C and Beyond. In the book, it uses the mini LC3 assembly language. I also have books on assembly programming and computer architecture. The few famous ones i have are Computer Organization and...
I had a Microsoft Technical interview this past Friday, the question I was asked was this : How do you find the middle value for a dataset that is too big to fit in RAM? I was not able to figure this out during the interview, but I have been look in this all weekend and I read something online that said it can be done at O(N) using something called the counting sort histogram algorithm ( I did not learn that in my advanced data structures and algorithms class). I have watched some youtube...
Back
Top