W3Cschool
恭喜您成為首批注冊(cè)用戶(hù)
獲得88經(jīng)驗(yàn)值獎(jiǎng)勵(lì)
簡(jiǎn)單的爬蟲(chóng)案例: 將百度的首頁(yè)爬取下來(lái)并保存在文件中。
import java.io.BufferedReader;
import java.io.BufferedWriter;
import java.io.FileOutputStream;
import java.io.IOException;
import java.io.InputStreamReader;
import java.io.OutputStreamWriter;
import java.io.UnsupportedEncodingException;
import java.net.URL;
public class Test {
public static void main(String[] args) throws UnsupportedEncodingException, IOException {
URL url = new URL("http://www.baidu.com");
BufferedReader bReader =
new BufferedReader(new InputStreamReader(url.openStream(), "utf-8"));
BufferedWriter bWriter =
new BufferedWriter(new OutputStreamWriter(new FileOutputStream("baidu.html")));
String msg = null;
while((msg = bReader.readLine()) != null) {
// System.out.println(msg);
bWriter.append(msg + "\n");
}
bWriter.close();
bReader.close();
}
}
Copyright©2021 w3cschool編程獅|閩ICP備15016281號(hào)-3|閩公網(wǎng)安備35020302033924號(hào)
違法和不良信息舉報(bào)電話(huà):173-0602-2364|舉報(bào)郵箱:jubao@eeedong.com
掃描二維碼
下載編程獅App
編程獅公眾號(hào)
聯(lián)系方式:
更多建議: