Create multiple subtables from multiple .tsv tables

Question

julian gaviria 2025 年 1 月 29 日 14:05

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2173447-create-multiple-subtables-from-multiple-tsv-tables

移動済み: Voss 2025 年 1 月 30 日 23:14

I have 120 .tsv files (see example example in "sub-m0001_file.tsv"). The path is the same for all the files except in the 9th folder. See the paths for the first two .tsv files below:

/f1/f2/f3/f4/f5/f6/f7/f8/sub-m0001/f10/f11/file.tsv

/f1/f2/f3/f4/f5/f6/f7/f8/sub-m0002/f10/f11/file.tsv

How can I get subtables (i.e., 1 table per file) including only the following six columns: 'trans_x', 'trans_y', 'trans_z', 'rot_x', 'rot_y', 'rot_z'?

The following code does it only for the first .tsv file. Any hint to go recursively over the 120 .tsv files?

mat = dir('/f1/f2/f3/f4/f5/f6/f7/f8/sub-m*/f10/f11/file.tsv');
for files_i = 1:length(mat)
    data = fullfile(mat(files_i).name); 
    x = readtable(data,"FileType","text",'Delimiter', '\t');
    vars = {'trans_x' 'trans_y' 'trans_z' 'rot_x' 'rot_y' 'rot_z'};
    new_x = x(:,vars);
end

Then, I need to store each file in a folder which filename corresponds to sub-m*. for instance (example sub-m0001_subfile.txt) see:

/new/path/sub-m0001/sub-m0001_subfile.txt

/new/path/sub-m0002/sub-m0002_subfile.txt

Many thanks in advance

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Stephen23 2025 年 1 月 29 日 18:49

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2173447-create-multiple-subtables-from-multiple-tsv-tables#answer_1558536

編集済み: Stephen23 2025 年 1 月 30 日 4:38

MATLAB Online で開く

"The following code does it only for the first .tsv file. Any hint to go recursively over the 120 .tsv files? "

There is nothing in your code that in any way stores or saves the data from each iteration, so your code iterates through each file, imports the file data, and then discards/overwrites the file data on the next loop iteration. So in the end it might look as if it only imported data from one file. But looking at the value of files_i would tell you how many files it has iterated over.

Solution: either use indexing to allocate the imported data into one array (e.g. a cell array or structure array) or export the data into files on each iteration.

"I need to store each file in a folder which filename corresponds to sub-m*. for instance (example sub-m0001_subfile.txt) "

Then you need to export the table data. For example:

V = {'trans_x','trans_y','trans_z','rot_x','rot_y','rot_z'};
S = dir('/f1/f2/f3/f4/f5/f6/f7/f8/sub-m*/f10/f11/*file.tsv');
for k = 1:numel(S)
    % import file data:
    F = fullfile(S(k).folder,S(k).name); 
    T = readtable(F,"FileType","text",'Delimiter', '\t');
    % optional: store imported filedata:
    S(k).data = T;
    % export table data:
    G = extractBefore(S(k).name,'_');
    H = fullfile('/new/path',G,[G,'_subfile.txt']);
    U = T(:,V);
    writetable(U,H)
end

The data is all stored in the structure S. You can access this using indexing, e.g. the 2nd file:

S(2).folder % location
S(2).name % filename
S(2).data % imported file data

6 件のコメント
4 件の古いコメントを表示4 件の古いコメントを非表示

julian gaviria 2025 年 1 月 30 日 14:13

編集済み: julian gaviria 2025 年 1 月 30 日 14:30

MATLAB Online で開く

Thanks so much @Stephen23

I struggle with importing the multiple output .txt files.

V = {'trans_x','trans_y','trans_z','rot_x','rot_y','rot_z'};
S = dir('/f1/f2/f3/f4/f5/f6/f7/f8/sub-m*/f10/f11/file.tsv');
for k = 1:numel(S) %numel returns the number of elements in array(e.g. struct)S 
    % import file data:
    F = fullfile(S(k).folder,S(k).name); 
    T = readtable(F,"FileType","text",'Delimiter', '\t');
    % optional: store imported filedata:
    S(k).data = T;
    % export table data:
    G = extractBefore(S(k).name,'.tsv'); 
    E = extractBetween(S(k).folder,"f8/","/f10"); %Unlike "extractBefore","extractBetween" creates cell arrays 
    B  =horzcat(E{:});
    H = fullfile('/new/path/',B,[G,'.txt']);
  % H = fullfile('/new/path/',[G,'.txt']); %trial for option2
    U = T(:,V);
    writetable(U,H,'WriteVariableNames', 0)
end

To note, the file names are the same in the input and in the output. Also, I need to add the iterated 9th folder (sub-m*) in the output's path (see "E" variable above).

To recall:

Paths for input .tsv files

/f1/f2/f3/f4/f5/f6/f7/f8/sub-m0001/f10/f11/file.tsv

/f1/f2/f3/f4/f5/f6/f7/f8/sub-m0002/f10/f11/file.tsv

requested path for output files:

/new/path/sub-m0001/file.txt

/new/path/sub-m0002/file.txt

Option2; creating the multiple .txt files and exporting them to "/new/path/". Then, they can be furtherly moved to individual folders with "movefile" . Something like:

/new/path/sub-m0001_file.txt

/new/path/sub-m0002_file.txt

Stephen23 2025 年 1 月 30 日 19:36

編集済み: Stephen23 2025 年 1 月 30 日 20:27

MATLAB Online で開く

"Issue 2: "E" does not work properly"

Please show the exact path and code that you used. It works as expected here:

P='./f1/f2/f3/f4/f5/f6/f7/f8/sub-m0001/f10/f11'; mkdir(P); dlmwrite(fullfile(P,'file.tsv'),1)
P='./f1/f2/f3/f4/f5/f6/f7/f8/sub-m0002/f10/f11'; mkdir(P); dlmwrite(fullfile(P,'file.tsv'),2)
S = dir('./f1/f2/f3/f4/f5/f6/f7/f8/sub-m*/f10/f11/file.tsv');
for k = 1:numel(S)
    E = regexprep(S(k).folder,{'^.*/f8/','/f10/.*$'},'')
end
E = 'sub-m0001'
E = 'sub-m0002'

Which means that you are doing something different to what you explained or showed, e.g. your folder names are not really f1, f2, etc. Guessing important information like this is much less reliable than it being written down.

In any case, here are alternative approaches that might work for your (duplicated?) folder names:

for k = 1:numel(S)
    E = regexprep(S(k).folder,{'^.*/f8/','/.*$'},'')
end
E = 'sub-m0001'
E = 'sub-m0002'
for k = 1:numel(S)
    E = regexp(S(k).folder,'sub-m\d+','match','once')
end
E = 'sub-m0001'
E = 'sub-m0002'

julian gaviria 2025 年 1 月 30 日 23:10

移動済み: Voss 2025 年 1 月 30 日 23:14

MATLAB Online で開く

Dear @Stephen23

issue1: "I get the following error because, in deed, the output (destination) file does not exist, it must be created."

You were right, the problem was the incomplete filename in the anchor expression indicating the end of the input text. E.g.,:

incorrect

E = regexprep(S(k).folder,{'^.*/f8/','/f10/.*$'},'')

correct

E = regexprep(S(k).folder,{'^.*/f8/','/f101a/.*$'},'')

Issue 2: "Issue 2: "E" does not work properly"

thanks a lot for the input. "mkdir" was the solution

H = fullfile('/new/path',E);
mkdir(H)
N = fullfile(H,[G,'.txt']);
U = T(:,V);
writetable(U,N, 'WriteVariableNames',0)

サインインしてコメントする。

Create multiple subtables from multiple .tsv tables

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

6 件のコメント
4 件の古いコメントを表示4 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Create multiple subtables from multiple .tsv tables

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

6 件のコメント 4 件の古いコメントを表示4 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

6 件のコメント
4 件の古いコメントを表示4 件の古いコメントを非表示