Merging table rows, keep all columns

90 ビュー (過去 30 日間)
Marc Elpel
Marc Elpel 2019 年 11 月 13 日
コメント済み: Marc Elpel 2019 年 11 月 14 日
I'm trying to combine data from multiple tables into one. (data files attached). Seems like a simple join(), or outerjoin(), but every path has run into issues.
Specifically what I want to do:
  1. Add rows from table 2 to table 1.
  2. Keep all rows in both tables (append rows)
  3. Where column names match, use that column
  4. Where columns are new, add column to table width
  5. Keep column names (outer join is renaming based on source table)
  6. Some table values are empty and should combine as empty values in existing and/or new columns as needed.
Tried so far:
  1. Join - Fails do to some empty values
  2. Join w/Replaced nan - fails do to some other key value error
  3. outerjoin() w/multiple configuration options - all failed.
  4. innerjoin90 - does not seem like what I want (throwing out data).
When done combining the attached tables there should be slight more columns than the first table, and rows should be the sum of rows in both tables.
This should be a common issue so assuming I am missing some simple solution...?
Using Matlab 2016b
Marc
  6 件のコメント
Adam Danz
Adam Danz 2019 年 11 月 13 日
編集済み: Adam Danz 2019 年 11 月 13 日
I've read-in your tables and the column names match between both tables. Points 3 and 4 in your question (thanks for the numbering - that makes this easy to discuss) mention column names that do not match. Are there supposed to be column names that do not match?
I should add that upon reading in your table, Matlab had to modify some of the column names to conform to Matlab syntax.
Warning: Column headers from the file were modified to make them valid MATLAB identifiers before creating variable names for the table.
The original column headers are saved in the VariableDescriptions property.
Set 'PreserveVariableNames' to true to use the original column headers as table variable names.
files = {'RESULTS_SAMP1.CSV', 'RESULTS_SAMP2.CSV'}; %Full paths are always better
T1 = readtable(files{1},'Delimiter',',');
T2 = readtable(files{2},'Delimiter',',');
% Do column names match?
all(ismember(T1.Properties.VariableNames, T2.Properties.VariableNames)) % Yes
all(ismember(T2.Properties.VariableNames, T1.Properties.VariableNames)) % Yes
Marc Elpel
Marc Elpel 2019 年 11 月 13 日
Tried fixing names first with 'PreserveVariableNames', but this did not work. "No public property PreserveVariableNames exists for class matlab.io.text.DelimitedTextImportOptions." Lesser issue compared to others.
I randomly selected two files and they were giving me merging errors so I thought those had different columns. Some of my data DOES include differences; we can simulate that by deleting the 3rd column int he first table, and 5th column in the second table. (does not matter which we delete, just making them different). What join command will combine these tables keeping all rows, and adding columns as needed to match the data? In some cases there will be missing columns which should be stuffed with empty cells.

サインインしてコメントする。

採用された回答

Adam Danz
Adam Danz 2019 年 11 月 13 日
編集済み: Adam Danz 2019 年 11 月 13 日
% Read in the data
files = {'RESULTS_SAMP1.CSV', 'RESULTS_SAMP2.CSV'}; %Full paths are always better
T1 = readtable(files{1},'Delimiter',',');
T2 = readtable(files{2},'Delimiter',',');
% Simulate column-mismatch
T1 = removevars(T1,'SpecimenType'); % remove col 3
T2 = removevars(T2,'Test'); % remove col 5
% Vertically concatenate tables
T3 = outerjoin(T1,T2,'MergeKeys', true)
  4 件のコメント
Adam Danz
Adam Danz 2019 年 11 月 13 日
Glad I could help out.
Just so I understand, the problem you're describing isn't with the merging of tables, it's with importing the tables. Is that correct?
Have you tried importing the tables without using the PreserveVariableNames flag?
Could you attach one of the files causing problems?
Marc Elpel
Marc Elpel 2019 年 11 月 14 日
The problem is I need to sterilize the data for posting, and as soon as I make any change and save the file it works. There is something hidden in the original CSV files which is corrupting the importing. Unfortunately I cannot upload these files without modification.
I think I tried the PreserveVariableNames flag which was unknown in 2016b. Not using it now.
I'm going to close teh thread - thanks for your help!

サインインしてコメントする。

その他の回答 (0 件)

カテゴリ

Help Center および File ExchangeLogical についてさらに検索

製品

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by