Info

この質問は閉じられています。 編集または回答するには再度開いてください。

Database organization using algorithms

1 回表示 (過去 30 日間)
Lee
Lee 2017 年 4 月 12 日
閉鎖済み: MATLAB Answer Bot 2021 年 8 月 20 日
Hey everyone,
I'm not even sure if there's a way to accomplish this at all. I'm working on converting a bunch of Excel files into a database. Failry trivial overall but the largest hurdle i'm running into is that there are either multiple items that are the same thing but spelled incorrectly or a cell might have some extraneous information that should be split into a another cell. Again, it's fairly trivial to find and fix these things but it's becoming incredibly tedious even with built in Excel functions.
I use Matlab extensively for data analysis but now i'm curious if it can help me sort these excel list I have. My intuition says that it's likely more work than it's worth to properly set up an algorithm for each Workbook, since they aren't always in the same format. But then again, it's really just a string computation (possibly the Levenshtein distance algorithm?) and then replacing returned strings and choosing a master string to set all those as. It's probably a bit more complicated if there's a small change in the string (Example: Trim16 or Trim17).

回答 (1 件)

Sanjana Ramakrishnan
Sanjana Ramakrishnan 2017 年 4 月 18 日
You can use the implementation of Levenshtein and edit distance algorithms present in the file exchange for your requirement. Refer the below file exchange links:
https://www.mathworks.com/matlabcentral/fileexchange/17585-calculation-of-distance-between-strings
https://www.mathworks.com/matlabcentral/fileexchange/39049-edit-distance-algorithm

この質問は閉じられています。

タグ

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by