Evolving strategies for web crawler

Küçük, Kamil

Evolving strategies for web crawler

Files

944.pdf (798.42 KB)

Date

2009-05-06

Authors

Küçük, Kamil

Publisher

Işık Üniversitesi

Access Rights

info:eu-repo/semantics/openAccess
Attribution-NonCommercial-NoDerivs 3.0 United States

Abstract

With the rapid growth of Internet and Internet-based information, it becomes the largest and publicly accessible data source in the world. Every day millions of information available so to achieve information becomes harder. To get the correct information trusted web sites and search engines are used. Trusted web sites have links between themselves, and users can reach correct and relevant information. Search engines are using crawler to follow links between pages. The context available to such crawlers can guide the navigation of links with the goal of efficiently locating highly relevant target pages. Crawler takes seed pages from search engines and follows these links using multi-agents. After first search, the results are inserted to database and they are used for seed pages for another search. The aim is the get access more reliable information using more seed pages in a short time.
İnternet ve internet temelli bilgilerin süratli büyümesi, internet dünyada en fazla kullanılan kaynak haline getirmiştir. Her gün milyonlarca bilginin girmesi ile büyüyen internette bilgiye ulaşmakta zorlaşmıştır. Doğru bilgiye ulaşmak için güvenilen siteler veya arama motorları kullanılmakatdır. İnternette güvenilen sayfalar birbiri arasında bağ oluşturarak, kullanıcıların doğru ve ölçeklenebilir bilgiye ulaşmasını sağlamaktadır. Arama motorlarında alınan başlangıç sayfalarında bulunan linkleri çoklu ajanlar kullanarak takip edilmiş ve ölçeklendirilmeye çalışılmıştır.İlk arama yapıldıktan sonra bunlar veritabanına kaydedilmiş başka aramalar için başlangıç sayfası olarak kullanılmıştır. Böylece daha once ulaşılan bilgiye daha kısa sürede ulaşmak, daha fazla sayfa üzerinde arama yapmak ve daha güvenilir bilgiye ulaşmak amaçlanmıştır.

Description

Text in English ; Abstract: English and Turkish
Includes bibliographical references (leaves 47-49)
viii, 49 leaves

Citation

Küçük, K., (2009). Evolving strategies for web crawler. İstanbul: Işık Üniversitesi Fen Bilimleri Enstitüsü.

URI

https://hdl.handle.net/11729/944

Collections

Lisansüstü Eğitim Enstitüsü Tez Koleksiyonu

Full item page

Evolving strategies for web crawler

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Access Rights

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Journal or Series

WoS Q Value

Scopus Q Value

Volume

Issue

Citation

URI

Collections