Problem with using fopen

2021 4 月 6

0 回答

2021 4 月 6 に更新

16 ビュー (30 日間)

サインインしてこの質問に回答する。

Follow Question

サインインしてこの質問に回答する。

Follow Question

古いコメントを表示

0 投票

TestCOA.pdf

The goal is not just get the words from a pdf like you get from extractFileText(filename) syntax, but also the position of each sentence. The solution i use is to read the pdf and then flatedecode it to acive this information. After decoding the information can look like this:

I found a pyhonscript* that works and i want to translate it into matlab.

...here comes the problem

Python:

pdf = open("TestCOA.pdf","rb").read() <--- python read the file perfectly

Matlab:

fileID = fopen("TestCOA.pdf",'rb','n','us-ascii');

A = fscanf(fileID,'%c') <-- reads some char but mixed with invalid characters <?>

pdf=py.open("TestCOA.pdf","rb").read() <-- same results with the python integration syntax

Upploaded example pdf to try it out. Hope someone can help me to figure this out. :)

*The full python script: https://gist.github.com/averagesecurityguy/ba8d9ed3c59c1deffbd1390dafa5a3c2

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

回答 (0 件)

サインインしてこの質問に回答する。

カテゴリ

ヘルプセンターおよび File Exchange で Startup and Shutdown についてさらに検索

製品

MATLAB

タグ

2021 年 4 月 6 日

2021 年 4 月 6 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by