Building Topic-Specific Search Engines : A Data Mining Approach
저자
발행사항
Los Angeles : University of California, 2000
학위논문사항
Thesis(doctoral)-- University of California: Computer Science 2000
발행연도
2000
작성언어
영어
주제어
KDC
569 판사항(4)
발행국(도시)
California
형태사항
xix, 153p. : Charts ; 26cm
일반주기명
References: p. 143-153
소장기관
Topic specific search engines are becoming popular with the phenomental growth of the World Wide Web. They have higher accuracy rate than general purpose search engines, and offer functions they cannot provide. But the topic-specific search engines available nowadays have very low cost-efficiency, because they require intensive human labor, and thus enormous cost, to upkeep as weell as to build. Efficient processing of the exploding information in the World Wide Web seems to call for smarter search engines, topic-specific search engines that require far less human labor while performing almost as well as those built and maintained by humans. This dissertation is a contribution towards meeting this demand. Building and maintaining topic-specific search engines with minimal human labor requires an automatic or semi-automatic informatino gathering system, the outputs of which can be fed to the search engines. In the dissertation, I discuss techniques for four major components of the requisite information gathering system:
(1) Domain information extraction
(2) Topic expansion
(3) Topic-driven information gathering
(4) Text-classification system for web documents
I also discuss the performance of the prototype system, a search engine for XML, that I built to test the techniques.
분석정보
서지정보 내보내기(Export)
닫기소장기관 정보
닫기권호소장정보
닫기오류접수
닫기오류 접수 확인
닫기음성서비스 신청
닫기음성서비스 신청 확인
닫기