Manipulating data in PDF Files with a program

Click For Summary

Discussion Overview

The discussion revolves around the challenge of extracting and manipulating data from PDF files for the purpose of plotting gravimetric analysis data as part of a thesis project. Participants explore programming solutions to automate the process of converting PDF tables into a usable format.

Discussion Character

  • Homework-related
  • Exploratory

Main Points Raised

  • The original poster seeks advice on the best programming language to automate the extraction of data from PDF files, specifically mentioning MATLAB for its capabilities in handling TXT files and plotting.
  • Some participants suggest that it may be more efficient to request the data in a different format directly from the partner laboratory, rather than attempting to manipulate PDF files.
  • The original poster expresses a desire to enhance their programming skills through this task, indicating a willingness to learn and explore solutions independently.
  • There is a side conversation about forum etiquette and features, including the use of signature files for symbols, which appears to confuse the original poster.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the best approach to the problem, with some advocating for direct data requests while others focus on programming solutions. The discussion remains unresolved regarding the most effective method for data extraction.

Contextual Notes

The original poster's approach relies on the assumption that MATLAB can interface with Adobe Reader to save PDF files as TXT, which remains unverified. There is also an implicit limitation regarding the availability of data in formats other than PDF.

schrodingerscat11
Messages
86
Reaction score
1

Homework Statement



I am currently doing my thesis, and the gravimetric analysis data I receive from a partner laboratory is in the form of tables in PDF Files. I need to plot the data. It would be tedious if I copy and paste every cell manually, so I decided to automate it using a program.

Here's the instructions I want to implement:
1. Open the PDF File.
2. Save it as a TXT file.
3. Open the data in TXT file (since it has delimeters already.)
4. Sort the data needed.
5. Plot the data.

My question is what is the best programming language that I can use for this task?

Homework Equations





The Attempt at a Solution


I decided to use MATLAB since it can open PDF files, it can manipulate data in TXT files, and it can easily plot that data.

However, I do not know how can MATLAB "instruct" Adobe Reader to save the file as TXT file. Is it even possible with MATLAB or other programming language?
 
Physics news on Phys.org
Not a direct answer to your question, but why don't you arrange for the partner lab to send you the data tables in whatever format they were before they were written to a pdf file?
 
  • Like
Likes   Reactions: 1 person
Thanks for the reply. I would do that in case I don't figure this out. I am just hoping to push my programming skills a little further. :smile: Anyway, I did not understand this part of your message:

Handy symbols: α β γ δ ε ζ η θ ι κ λ μ ν ξ ο ° π ρ ς σ τ υ φ χ ψ ω Ω ~ ≈ ≠ ≡ ± ≤ ≥ Δ ∇ Σ ∂ ∫ ∏ → ∞

Put them in your signature file and they will be there for your use when you preview your posts.

I'm sorry; it's my first time to subscribe to forums. What is a signature file?
 
Thanks for the reply. I would do that in case I don't figure this out. I am just hoping to push my programming skills a little further. :smile: Anyway, I did not understand this part of your message:

Handy symbols: α β γ δ ε ζ η θ ι κ λ μ ν ξ ο ° π ρ ς σ τ υ φ χ ψ ω Ω ~ ≈ ≠ ≡ ± ≤ ≥ Δ ∇ Σ ∂ ∫ ∏ → ∞

Put them in your signature file and they will be there for your use when you preview your posts.

I'm sorry; it's my first time to subscribe to forums. What is a signature file? :shy:
 
physicsjn said:
Thanks for the reply. I would do that in case I don't figure this out. I am just hoping to push my programming skills a little further. :smile: Anyway, I did not understand this part of your message:

Handy symbols: α β γ δ ε ζ η θ ι κ λ μ ν ξ ο ° π ρ ς σ τ υ φ χ ψ ω Ω ~ ≈ ≠ ≡ ± ≤ ≥ Δ ∇ Σ ∂ ∫ ∏ → ∞

Put them in your signature file and they will be there for your use when you preview your posts.

I'm sorry; it's my first time to subscribe to forums. What is a signature file? :shy:


It looks like you have figured it out. :approve:
 

Similar threads

Replies
7
Views
3K
Replies
7
Views
3K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 14 ·
Replies
14
Views
4K
  • · Replies 22 ·
Replies
22
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 15 ·
Replies
15
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 41 ·
2
Replies
41
Views
5K