|
对大多搜索引擎而言,除了常用的关键词一般搜索外,还提供限定搜索功能,这些功能因搜索引擎而异,或在高级搜索中设置,或需用特定的搜索语句来实现。各搜索引擎的搜索语句功用和形式大同小异,应该注意的到是此有彼无语句的功能和用法。建议首先熟悉GOOGLE的特殊搜索语句(已有许多中文介绍),再参考以下资料进一步比较。
Google
intitle: Finds pages that have the term(s) in the HTML title element. Can be
combined with other search terms. intitle:search engines. This should find
'search' in the title and 'engines' anywhere in the page.
inurl: Finds pages that have the term(s) somewhere in the URL (host name,
path, or filename). Can be combined with other search terms.
inurl:searchenginewatch.
allintitle: Finds pages that have the term(s) in the HTML title element.
allintitle:search engines.
link: Finds pages which contain hypertext links to the exact specified URL.
link:notess.com/search finds pages with links to this site.
allinurl: Finds pages that have the term(s) somewhere in the URL (host name,
path, or filename). allinurl:searchenginewatch.
site: Finds pages from the designated Web site. Paths and file names cannot
be included. An additional search term must be used. Try a term from the
domain name for the most comprehensive results. notess site:notess.com finds
how many pages Google has index or listed.
allinanchor: Finds pages that have the term(s) somewhere in the links to the
page. .
related: Invokes GoogleScout to find other pages similar in linkage patterns
to the given URL and at a similar hierarchical level. The URL must be exact.
In other works related:notess.com and related:www.notess.com find different
results.
flink: Used to find pages linked from the given URL.
********
alltheweb
url: in the URL Pages have the term(s) somewhere in the URL (host name,
path, or filename).
link: or link.all: in the link to URLPages that link to this URL or portion
of a URL.
title: ornormal.title: in the title Hits have the term(s) in the HTML title
element.
site: none, under domain filters instead A better, more exact match for the
domain name. Introduced in Sept. 2002, the site: command is shorter to type,
more common at other search engines, and more of an exact match. For
example, site:www.total.com finds different results than
site:www.total.com.au.
The site: command can be used with two additional operators, the carat ^ and
the asterisk *. The ^ anchors the domain while the * unanchors. In other
words, site:^total.com will no match either www.total.com or total.com.au.
And site:*total.com* will match total.com, www.total.com, and total.com.au.
The * and ^ can be used within the same query, and the default is to have
the end anchored but not the beginning as in site:*total.com^
url.all: in the URL Same as url: above. Pages have the term(s) somewhere in
the URL (host name, path, or filename).
url.domain: in the host name Pages with the specified term anywhere in the
domain name.
url.tld: none Pages within the specified top level domain.
url.host: none Pages with the specified host name.
normal.titlehead: none Hits have the term(s) in the HTML title element or
elsewhere within a HEADER tag.
link.extension: none Matches pages that contain files with the specified
extension.
******
AltaVista
anchor: Term(s) located in the text of a hyperlink. anchor:"search engine
showdown"
applet: Pages containing a Java applet with the term in the name.
applet:morph
domain: For top-level domain only. domain:edu
host: For a particular site. host:notess.com
image: Pages have an image with term in filename. image:gull finds pages
with gull.gif
link: Hypertext links include the term(s). link:notess.com finds pages with
links to this site.
text: Pages include the term(s) somewhere other than in an image tag, link,
or URL.
title: Hits have the term(s) in the HTML title element. title:"search
engines"
url: Pages have the term(s) somewhere in the URL (host name, path, or
filename). url:searchenginewatch
like: Find similar pages to the submitted URL. Requires a complete URL,
although the http:// can be omitted. Works in Simple Search, Advanced Search
Sort by box, but not Advanced Search Boolean box. Is the same function as
clicking on the Related pages link in the display. It cannot be combined
with other search terms. like:notess.com
********
HotBot
title: Hits have the termin the HTML title element. Only a single word can
be searched this way. Use the drop down option to search multiple title
words. title:showdown
domain: For domains up to three levels deep. domain:dept.stateu.edu
depth:[number] Number of subdirectories deep in a Web site. depth:3
linkdomain: Limits to pages containing links to the specified domain.
linkdomain:notess.com
*******
Teoma
intitle: Finds pages that have the term(s) in the HTML title element. If one
intitle: is used, all search terms are searched within the title element.
Can be combined with a phrase search. intitle:"search engines".
inurl: Finds pages that have the term(s) somewhere in the URL (host name,
path, or filename). If one inurl: is used, all search terms are searched
within the title element. Can be combined with a phrase search.
inurl:searchenginewatch.
site: Finds pages from the designated Web site. This is really a domain
limit. Top level domain must be included. Paths and file names cannot be
included. An additional search term must be used. Try a term from the domain
name for the most comprehensive results. A search like notess
site:notess.com finds how many pages are included for a specific site.
inlink: Is supposed to finds pages that have the term(s) somewhere in the
anchor text, but this does not yet appear to work properly. .
********
Gigablast
ip: Page is the specified IP range. Incomplete numbers are truncated.
ip:216.32.120 finds any computer in 216.32.120.*
link: Pages include a link to the specified URL.
link:searchengineshowdown.com finds pages with links to this site.
site: Results are only from the specified site. site:nasa.gov finds pages at
NASA's Web site
suburl: Pages have the term(s) somewhere in the URL (host name, path, or
filename). suburl:searchenginewatch
title: Hits have the term(s) in the HTML title element. title:"search
engines"
type: File type. Options as of Aug. 2003 are
type:pdf for Adobe Acrobat PDFs
type:doc for Microsoft Word documents
type:ppt for PowerPoint presentations
type:xls for Excel spreadsheets
type:ps for PostScript files
type:text for ASCII text files
type:html for HTML Web pages
url: Result must be exactly this URL and nothing else. url:www.slashdot.com/index.html
|