urlread-function is very slow since R2012b - why?

Hi,
I'm using a small piece of code to read out information from about 100 websites by using urlread in its easiest form:
for count = 1:elements
html_str = urlread(['http://www.website.com/page' num2str(count)]);
end
Yesterday I changed my Matlab-version from R2011a to R2012b and was slightly shocked. With the old version the loop took about 20-30 seconds. The new one needs nearly 5 Minutes!
My research quarry that Mathworks has expanded the urlread-function with 3 additional parameters and took some old ones away. http://www.mathworks.de/de/help/matlab/release-notes.html But I do not use any of these parameters, only the naked function. I tried the manual reduction of the 'Timeout' to 0.1, but with no influence.
Are there any ideas, what causes the speed-loss and first of all how I can get my old performance back? Thanks a lot for your assistance!

 採用された回答

Jan
Jan 2012 年 11 月 11 日

0 投票

Why do you assume that the problem is caused by Matlab? Do you have the old Matlab still installed? If so, please check if the old version is slower today also.

5 件のコメント

Sven Meister
Sven Meister 2012 年 11 月 11 日
Hm, no, I installed Windows 8 and by the way the newest Matlab Version. But I can't imagene which influence Win8 should have on the urlread-function. My old system was also 64-bit on the same hardware. After I saw, that they had changed the urlread-function, I assumed it was caused by the new Matlab version.
Do you have an idea? My old Matlab version was killed by the new Windows.
Thanks!
Jan
Jan 2012 年 11 月 11 日
編集済み: Jan 2012 年 11 月 12 日
Oh, you have installed Windows 8 at the same time. This is a very important information, which should appear in the original question.
The network traffic is checked by virus scanners. The TCP/IP-connection is influenced by different drivers. Perhaps Windows 8 does not use a proper packet size for your ethernet card or runs your WLAN with the wrong speed, etc.
Jan
Jan 2012 年 11 月 12 日
Please avoid cross posting. It wastes the time of the ones who voluntarily try to help. Thanks!
Andreas Goser
Andreas Goser 2012 年 11 月 12 日
Aha - I found the crossposting Jan mentioned on gomatlab.de :-)
Andreas Goser
Andreas Goser 2012 年 11 月 13 日
Looks like you found a solution, can you share what helped? And I found Wni8 IS supported. Seems to be back qualiified: http://www.mathworks.com/support/sysreq/current_release/

サインインしてコメントする。

その他の回答 (2 件)

Clemens
Clemens 2013 年 11 月 28 日

1 投票

1 件のコメント

James
James 2014 年 1 月 13 日
Solved my problem too - on testing, urlread2 is around 10 times faster than urlread.

サインインしてコメントする。

John Hedengren
John Hedengren 2013 年 11 月 8 日

0 投票

I also noticed this loss in speed that Sven mentioned on some of our lab computers. We were running 2010b and 2012b. The same code would take 2-3 seconds on 2010b and over 5 minutes on 2012b so we naturally used 2010b. We use a web-service architecture in the APMonitor Optimization toolbox so some users switched to APM Python because of the performance decrease. I recently tried a modified version of urlread with just the basic function calls (see below). It seems to hang up when it tries to get a response from the server.
inputStream = urlConnect.getInputStream;
Any suggestions would be greatly appreciated. The code that we are using is online at APMonitor.com. Running test_all.m should run a number of example problems. For some users it is fast but for others it is extremely slow.
function output = urlread_apm(str)
try
if usejava('jvm'),
import com.mathworks.mlwidgets.io.InterruptibleStreamCopier;
% use Java if available
url = java.net.URL(str);
urlConnect = url.openConnection;
timeout = 1000000000;
method = 'GET';
urlConnect.setRequestMethod(upper(method));
urlConnect.setReadTimeout(timeout);
urlConnect.setDoOutput(true);
outputStream = urlConnect.getOutputStream;
outputStream.close;
try
inputStream = urlConnect.getInputStream;
isGood = true;
catch ME
inputStream = urlConnect.getErrorStream;
isGood = false;
if isempty(inputStream)
msg = ME.message;
I = strfind(msg,char([13 10 9]));
if ~isempty(I)
msg = msg(1:I(1)-1);
end
error(msg)
end
end
byteArrayOutputStream = java.io.ByteArrayOutputStream;
isc = InterruptibleStreamCopier.getInterruptibleStreamCopier;
isc.copyStream(inputStream,byteArrayOutputStream);
inputStream.close;
byteArrayOutputStream.close;
output = char(typecast(byteArrayOutputStream.toByteArray','uint8'));
% output = typecast(byteArrayOutputStream.toByteArray','uint8');
else
% urlread is sometimes slower after R2012a
% use only if Java is not available
output = urlread(url);
end
catch Err
output = urlread(url);
end
end

カテゴリ

ヘルプ センター および File ExchangeStartup and Shutdown についてさらに検索

製品

タグ

質問済み:

2012 年 11 月 11 日

コメント済み:

2014 年 1 月 13 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by