A web crawler (tseem hu ua web kab laug sab, kab laug sab bot, web bot, lossis yooj yim crawler) yog khoos phis tawj software uas yog siv los ntawm lub tshuab tshawb nrhiav rau index nplooj ntawv web thiab cov ntsiab lus thoob plaws ntiaj teb Wide Web. … Kev tshawb nrhiav indexing tuaj yeem muab piv rau phau ntawv indexing.
crawler hauv ICT yog dab tsi?
A web crawler (tseem hu ua web kab laug sab lossis web neeg hlau) yog a program lossis automated script which browses the World Wide Web in a methodical, automated types. Cov txheej txheem no hu ua Web crawling los yog kab laug sab. Ntau qhov chaw raug cai, tshwj xeeb hauv kev tshawb fawb xyaw, siv kab laug sab los ua ib qho kev muab cov ntaub ntawv tshiab.
web crawler siv rau dab tsi?
Nrhiav cov ntaub ntawv los ntawm kev nkag mus
Peb siv software hu ua web crawlers txhawm rau tshawb pom cov nplooj ntawv uas muaj nyob hauv lub vev xaib. Crawlers saib cov nplooj ntawv web thiab ua raws cov kev sib txuas ntawm cov nplooj ntawv, zoo li koj xav tau yog tias koj tab tom nrhiav cov ntsiab lus hauv lub vev xaib. Lawv mus ntawm qhov txuas mus txuas thiab nqa cov ntaub ntawv hais txog cov nplooj ntawv web rov qab mus rau Google cov servers.
Tus neeg sawv cev hom twg yog lub vev xaib crawler?
A Web crawler yog ib qho hom bot, lossis software tus neeg sawv cev. Feem ntau, nws pib nrog cov npe ntawm URLs mus xyuas, hu ua cov noob. Raws li tus neeg nkag mus saib cov URLs no, nws txheeb xyuas tag nrho cov hyperlinks hauv nplooj ntawv thiab ntxiv rau lawv rau cov npe URLs mus ntsib, hu ua tus nkag nkag mus.
nkag mus piav qhia meej yog dab tsi?
Crawling yog thaum Google lossis lwm lub tshuab tshawb nrhiav xaib bot mus rau nplooj ntawv web lossis lub vev xaib thiab "nyeem" nplooj ntawv. … Kev nkag mus yog thawj feem ntawm kev muaj lub tshuab tshawb nrhiav pom koj nplooj ntawv thiab qhia nws hauv cov txiaj ntsig tshawb. Muaj koj nplooj ntawv nkag mus, txawm li cas los xij, tsis tas txhais tau tias koj nplooj ntawv yog (lossis yuav) indexed.