Remove duplicate rows in CSV file
5 ビュー (過去 30 日間)
古いコメントを表示
mohammad Alsajri
2019 年 7 月 23 日
コメント済み: mohammad Alsajri
2019 年 7 月 25 日
hello dear mathworkers,
I have a dataset consist of approximatlly 4 millions records, and i want to remove the duplicated rows or records, can any one help me with the way, i am using matlab 2018a . thanks in advance
7 件のコメント
採用された回答
Alex Mcaulley
2019 年 7 月 23 日
Since all is numeric data, you can use:
data = xlsread('kdd.xlsx');
datanew = unique(data,'rows');
2 件のコメント
Shameer Parmar
2019 年 7 月 23 日
This is not working, because non of data is similar.. I dont find duplicate entries in this sheet provided by Mohammad Alsajri..
using your command, the 'data' and 'datanew' both are getting exact same..
Alex Mcaulley
2019 年 7 月 23 日
This code works!
I guess the excel provided by Mohammad is just a small portion of the dataset (4 million of rows).
その他の回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で Web Services についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!
