Hello, everyone. I have the attached CSV file and there are some things that I have to do: # create a table, # read every text or number that they are, after the colon (:), # read all the rows from 1-37 and # transfer the variables names at the first row and the text and numbers to the second row meaning a table 2X37. I kindly asking for help because I have totally 7400 files to process!!! Thank you in advance.

How to process a csv file where numbers and text are at the same column?

6 ビュー (過去 30 日間)

古いコメントを表示

Nivodi 2018 年 7 月 22 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/411576-how-to-process-a-csv-file-where-numbers-and-text-are-at-the-same-column

コメント済み: Nivodi 2018 年 9 月 5 日

採用された回答: Image Analyst

test2_seq.csv

Hello, everyone. I have the attached CSV file and there are some things that I have to do:

create a table,
read every text or number that they are, after the colon (:),
read all the rows from 1-37 and
transfer the variables names at the first row and the text and numbers to the second row meaning a table 2X37.

I kindly asking for help because I have totally 7400 files to process!!! Thank you in advance.

2 件のコメント
なしを表示なしを非表示

Walter Roberson 2018 年 7 月 22 日

That appears to be the 1991 MSA/MAS spectral file format, as described at https://www.microscopy.org/resources/scientific_data/HMSAFileFormat-Presentation_2012_small.pdf

Can you output as .hmsa instead? If so then I see https://github.com/pyhmsa/pyhmsa-matlab which appears to be python code to convert .hmsa into matlab .mat files.

Nivodi 2018 年 7 月 23 日

No, unfortunately, I can't.

サインインしてコメントする。

サインインしてこの質問に回答する。

採用された回答

Image Analyst 2018 年 7 月 22 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/411576-how-to-process-a-csv-file-where-numbers-and-text-are-at-the-same-column#answer_329845

MATLAB Online で開く

That's not a CSV file. You might have to write a custom reader for it. Here's a start:

fullFileName = fullfile(pwd, 'test2_seq.csv')
% Open the file.
fileID = fopen(fullFileName, 'rt');
% Read the first line of the file.
textLine = fgetl(fileID);
while ischar(textLine)
  % Read the remaining lines of the file.
  fprintf('%s\n', textLine);
  if contains(textLine, '#DATE', 'IgnoreCase', true)
    colonLocation = strfind(textLine, ':');
    semicolonLocation = strfind(textLine, ';');
    DATE = strtrim(textLine(colonLocation + 2 : semicolonLocation(1)));
  elseif contains(textLine, '#TIME', 'IgnoreCase', true)
  elseif contains(textLine, '#OWNER', 'IgnoreCase', true)
  elseif contains(textLine, '#NPOINTS', 'IgnoreCase', true)
  elseif contains(textLine, '#NCOLUMNS', 'IgnoreCase', true)
  elseif contains(textLine, '#XUNITS', 'IgnoreCase', true)
  elseif contains(textLine, '#YUNITS', 'IgnoreCase', true)
  elseif contains(textLine, '#DATATYPE', 'IgnoreCase', true)
  elseif contains(textLine, '#XPERCHAN', 'IgnoreCase', true)
  elseif contains(textLine, '#OFFSET', 'IgnoreCase', true)
  elseif contains(textLine, '#OFFSET', 'IgnoreCase', true)
  elseif contains(textLine, '#LIVETIME', 'IgnoreCase', true)
  elseif contains(textLine, '#SIGNALTYPE', 'IgnoreCase', true)
    % etc.
  end
  % Read the next line.
  textLine = fgetl(fileID);
end
% All done reading all lines, so close the file.
fclose(fileID);

15 件のコメント
13 件の古いコメントを表示13 件の古いコメントを非表示

Walter Roberson 2018 年 7 月 25 日

You can zip the file and attach the zip

Nivodi 2018 年 7 月 25 日

Seq1.zip

Thank you, Walter! :))

サインインしてコメントする。

その他の回答 (1 件)

Walter Roberson 2018 年 7 月 26 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/411576-how-to-process-a-csv-file-where-numbers-and-text-are-at-the-same-column#answer_330387

MATLAB Online で開く

projectdir = '.';    %directory the files are in
dinfo = dir( fullfile(projectdir, '*.xsp') );
filenames = fullfile(projectdir, {dinfo.name});
nfiles = length(filenames);
table_so_to_speak = cell(nfiles, 1);
for K = 1 : nfiles
  S = fileread( filenames{K} );
  parts = regexp(S, '^#?(?<field>\w+)\s*: *(?<val>.*?)\r?$', 'names', 'lineanchors', 'dotexceptnewline');
  table_so_to_speak{K} = [{parts.field}; {parts.val}];
end

However, I suspect you want more like

projectdir = '.';    %directory the files are in
dinfo = dir( fullfile(projectdir, '*.xsp') );
filenames = fullfile(projectdir, {dinfo.name});
nfiles = length(filenames);
table_so_to_speak = cell(nfiles+1, 1);
for K = 1 : nfiles
  thisfile = filenames{K};
  S = fileread( thisfile );
  [~, basename] = fileparts(thisfile);
  parts = regexp(S, '^#?(?<field>\w+)\s*: *(?<val>.*?)\r?$', 'names', 'lineanchors', 'dotexceptnewline');
  if K == 1
    table_so_to_speak(1,1:length(parts)+1) = [{'filename'}, {parts.field}];
  end
  table_so_to_speak(K+1,:) = [{basename}, {parts.val}];
end

12 件のコメント
10 件の古いコメントを表示10 件の古いコメントを非表示

Walter Roberson 2018 年 7 月 26 日

The first few lines use dir() and fullfile to get a complete list of xsp filenames in the given directory.

After that you loop over each file.

You read an entire file at one go as a continuous string. Then you do some pattern matching on the string. The pattern matching looks for an optional # at the beginning of a line, followed by any combination of digits and letters and underscore (a "word"), followed by spaces, followed by colon, and stores the word in a structure field named "field". Then it continues looking on the line for spaces, followed by whatever is there, until the end of line that might be preceded by a carriage return, and it stores that "whatever is there" in a structure field named "val". It keeps doing that for the entire string that is the file contents, so it is building up a structure array of field names and corresponding values.

Once the structure array is found, if this is the first file then it stores the field names read in as the first row in a cell array; it does not bother to store the field names for files after that because it assumes they are the same in the same order as the first file. Regardless of whether it was the first file or not, it stores the filename and the field values as a row in the cell array.

This reading and text analysis and storing is done for all of the files.

Walter Roberson 2018 年 9 月 4 日

See https://www.mathworks.com/matlabcentral/fileexchange/47434-natural-order-filename-sort for some flexible code for sorting filenames.

Nivodi 2018 年 9 月 5 日

Thank you very much both of you!!

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

MATLAB Language Fundamentals Data Types Dates and Time

Help Center および File Exchange で Dates and Time についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

How to process a csv file where numbers and text are at the same column?

2 件のコメント
なしを表示なしを非表示

採用された回答

15 件のコメント
13 件の古いコメントを表示13 件の古いコメントを非表示

その他の回答 (1 件)

12 件のコメント
10 件の古いコメントを表示10 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

How to process a csv file where numbers and text are at the same column?

2 件のコメント なしを表示なしを非表示

採用された回答

15 件のコメント 13 件の古いコメントを表示13 件の古いコメントを非表示

その他の回答 (1 件)

12 件のコメント 10 件の古いコメントを表示10 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

2 件のコメント
なしを表示なしを非表示

15 件のコメント
13 件の古いコメントを表示13 件の古いコメントを非表示

12 件のコメント
10 件の古いコメントを表示10 件の古いコメントを非表示