Remove duplicate rows in CSV file
    8 ビュー (過去 30 日間)
  
       古いコメントを表示
    
    mohammad Alsajri
 2019 年 7 月 23 日
  
    
    
    
    
    コメント済み: mohammad Alsajri
 2019 年 7 月 25 日
            hello dear mathworkers, 
I have a dataset consist of approximatlly 4 millions records, and i want to remove the duplicated rows or records, can any one help me with the way, i am using matlab 2018a . thanks in advance 
7 件のコメント
採用された回答
  Alex Mcaulley
      
 2019 年 7 月 23 日
        Since all is numeric data, you can use:
data = xlsread('kdd.xlsx');
datanew = unique(data,'rows');
2 件のコメント
  Shameer Parmar
      
 2019 年 7 月 23 日
				This is not working, because non of data is similar.. I dont find duplicate entries in this sheet provided by Mohammad Alsajri.. 
using your command, the 'data' and 'datanew' both are getting exact same..
  Alex Mcaulley
      
 2019 年 7 月 23 日
				This code works! 
I guess the excel provided by Mohammad is just a small portion of the dataset (4 million of rows).
その他の回答 (0 件)
参考
カテゴリ
				Help Center および File Exchange で Web Services についてさらに検索
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!




