Glossary for Web Businesses and Website Design
The Internet is a world unto itself, created by geeks who delight in trying to outdo each other in creating obscure names for the technology. But fear not; we will reduce it to English here:
- ASP - Active Server Pages, (also Application Service Provider)
- Breadcrumbs - A navigation technique used in user interfaces that gives users a way of tracking their location within a website. If used, they tend to appear horizontally across the top of each webpage, below the main banner.
- CA - Certifying Authority. Authority which signs public key (PK), so some third party can verify, that the PK is really belongs to whom it claims to belong.
- CGI - Common Gateway Interface
- CORBA - Common Object Request Broker Architecture, it's distributed object computing infrastructure.
- DHTML - Dynamic HTML
- DOM - Document Object Model.
- DTD - Document Type Definition, this is the formal specification of a markup language, written using SGML
- HTML - HyperText Markup Language, if you don't know what this means, you are in the wrong place.
- HTTP - HyperText Transfer Protocol
- Java - Java programming language
- Java applet - Java application, which runs on the client side (inside the Web browser).
- JVM (Java VM) - Java Virtual machine.
- LDAP - Lightweight Directory Access Protocol, client-server protocol for accessing a directory service.
- NSAPI - Netscape API, API for writing plug-ins for Netscape browser.
- SGML - Standard Generalized Markup Language, a standard for describing markup languages
- SSI - Server Side Includes, technology, which allows to embed the simple server directives in the HTML pages
- SSL - Secure Sockets Layer, encrypted channel over which other protocols (like HTTP) can be used securely
- URL - Universal Resource Locator, the address of the resource on the Web. Look here for more info.
- URI - Uniform Resource Identifier. Basically the same as URL.
- VRML - Virtual Reality Markup language
- WYSIWYG - What You See Is What You Get
- XML - Extensible Markup Language
A named selection of data in a list, library, or content type, such as Title or Due Date. Columns are displayed vertically in most views and are displayed as fields in forms.
A value that is not calculated and, therefore, does not change. For example, the number 210, and the text "Quarterly Earnings" are constants. An expression, or a value resulting from an expression, is not a constant.
Content Placeholder Controls
Are indicated in Code view by <asp:ContentPlaceHolder>. They are regions of replaceable content on a master page.
A reusable group of settings for a category of content. Use content types to manage the metadata, templates, and behaviors of items and documents consistently. Content types are defined at the site level and used on lists and libraries.
A customized view of live data. You can use a Data View to view data from a wide variety of sources, including database queries, XML documents, Web services, SharePoint lists and libraries, and server-side scripts.
A prewritten formula that takes a value or values, performs an operation, and returns a value or values. Use functions to simplify and shorten formulas on a worksheet, especially those that perform lengthy or complex calculations.
A location on a SharePoint site where a collection of files is managed. The library can display information, including user-defined properties, about each file.
A Web site component that stores and displays information that users can add to by using their browsers. Requires a Web server that is running Windows SharePoint Services.
A numbered copy of a file that has changed significantly since the previous major version. Each major version is identified by a whole number (1, 2, 3, ...) to indicate that it is published for a wider group in your organization to view.
Master pages provide the look and feel that you want for all of the pages in your site. Master pages define the shared content — including the logo, team site name, shared navigation, and similar site elements — that is repeated from page to page in a site. It is an ASP.NET page that has the file name extension .master.
Microsoft Office SharePoint Designer 2007
A product for creating and customizing Microsoft SharePoint Web sites and building workflow-enabled applications based on SharePoint technologies. Office SharePoint Designer 2007 provides professional tools for building interactive solutions on the SharePoint platform without writing code, for designing custom SharePoint sites, and for maintaining site performance with reports and managed permissions.
Minimal Master Page
Also call a Base Master Page, a master page with only basic functionality, no formatting.
A decimal-numbered copy (0.1, 0.2, 1.1, ...) of a file that is in a stage of revision or that has changed only slightly since the previous version. Typically, a minor version is not published for a wider group in your organization to view.)
Acronym: Microsoft Office SharePoint Server 2007
A sign or symbol that specifies the type of calculation to perform within an expression. There are mathematical, comparison, logical, and reference operators.
A set of permissions that can be granted to users or SharePoint groups on an entity such as a site, library, list, folder, item, or document.
Authorization to perform specific actions such as viewing pages, opening items, and creating subsites.
A group of users that can be created on a SharePoint site to manage permissions to the site and to provide an e-mail distribution list for site users. A group can be used by many sites in one site collection.
A set of Web sites that have the same owner and share administration settings. Each site collection contains a top-level site, can contain one or more subsites, and may have a shared navigation structure.
A reusable column definition, or template, that you can assign to multiple lists across multiple SharePoint sites. Site columns are useful if your organization wants to establish some consistent settings across lists and libraries. You can share the settings across multiple lists and libraries so that you don't have to recreate the settings each time. For example, suppose you define a site column named Customer. Users can add that column to their content types or list. This ensures that the column has the same attributes, at least to start with, wherever it appears.
A set of colors, fonts, and decorative elements that provide a consistent appearance to your site.
A Web site at the top of the hierarchy in a site collection, from which you can manage site collection features. A top-level site can have multiple subsites.
Versioning enables you to store, track, and restore items in a list and files in a library as they are changed.
A set of columns on a Web page that displays items in a list or document library. The view can display sorted or filtered items, a selection of columns, or a custom layout.
A modular unit of information that consists of a title bar, a frame, and content. Web Parts are the basic building blocks of a Web Part Page.
Web Part Page
A special type of Web page that contains one or more Web Parts. A Web Part Page consolidates data, such as lists and charts, and Web content, such as text and images, into a dynamic information portal built around a common task.
Workflow is sometimes described as a series of tasks that produce an outcome. In the context of Microsoft SharePoint Products and Technologies, workflow is defined more narrowly as the automated movement of documents or items through a specific sequence of actions or tasks that are related to a business process.
Basic building blocks of a workflow include: events, actions, conditions, and steps. These building blocks allows you to quickly add application logic to your SharePoint applications. You create rules that associate conditions and actions with items in SharePoint lists and libraries. Changes to items in lists or libraries then trigger actions in the workflow.
Windows SharePoint Services 3.0
Extensible Markup Language
Extensible Stylesheet Language Transformations
Affiliate- An entity with a relationship with a peer or larger entity.
Affiliates can also be referred as publishers. In the Internet business,
affiliates promote someone else’s products in hopes of earning commissions
if the product is sold.
Affiliate Marketing- This is a web-based marketing practice in which a business rewards the affiliate for each visitor or customer brought about by the affiliate’s marketing efforts. Affiliates often use their own sites to drive traffic to another site.
Blog - The term blog is derived from the word weblog. It was originally used for personal web space as a means to communicate musings and other thoughts on a given subject. It has now been accepted as a professional corporate communication device between a company and its public. Blogs are usually very search engine friendly.
ClickBank - They are one of the largest online digital marketplaces that supply affiliates with products to sell. Joining as an affiliate is free and you can begin selling affiliate products immediately.
Click-Through - This is also known as a click. The consumer clicks on an advertiser’s link and is taken to another Web site. Click-through ratio is the percentage of click for the number of advertising impressions displayed.
Conversion rate - The percentage of visitors who take a desired action for an ad. The action that determines conversion is determined by the advertiser and may be a sale on the advertiser’s Web site, a completed lead form, or another action.
e-Tailer- Is a retailer that primarily uses the Internet as a medium for customers to shop for the goods or services provided. A good example of an e-Tailer is Amazon.
Earning Per Click (EPC) - This is the affiliate’s average earnings per 100 clicks. This metric is used by sponsors to measure the effectiveness of individual ads. EPC is a relative rating that illustrates the ability to convert clicks into commissions.
Impression- The viewing of an advertising banner, link, or product.
Internet Marketing - Also known as Online Marketing, is essentially the marketing of goods or services over the Internet. Internet Marketing help grow and promote a business through online media. Internet marketing strategy includes all aspects of online advertising products, services, and websites, including market research, email marketing, and direct sales.
Internet Service Provider (ISP) - A business or organization that provides consumers or businesses access to the Internet and related services. Various technologies such as dial-up, DSL and broadband (cable) are used to deliver the information to the consumer.
Keywords - These are words of significant value which are used as an index to find other information. Keywords are commonly used to target selected web traffic to your site.
Metrics - An in-dept data gathering about visitor behavior on your website. It tells you information such as who is coming to your site, where they are coming from, what web pages they are viewing, how long they stay on your site, etc.
Search Engine Marketing (SEM) - The act of marketing a Web site via search engines. This can be done by “natural” or “organic” search which refers to all unpaid search results based on relevance in the content site. Another way is by “paid” search, which refers to all pay-per-click advertising (such as on Google, Yahoo! or MSN).
Search Engine Optimization (SEO) - Is the process of improving the volume and quality of traffic to a web site from search engines via “natural” (”organic” or “algorithmic”) search results for targeted keywords. In general, it is considered good to have your listings occur on the first page of a Yahoo, Google, MSN or other web search engine. SEO helps to “rank” you higher in these listings.
Web Analytics - The process of analyzing the behavior metrics gathered from the visitors to your site. Businesses use these statistics to help determine how well they are satisfying their customers.
Web Crawling- A web crawler (also known as a web spider or web robot) is a program or automated script which browses the World Wide Web in a methodical, automated manner. This process is called web crawling or spidering.
Web Hosting Service - A company that provides space on a server they own for use by their clients. This allows individuals and organizations to provide their own websites for viewing on the world wide web. Two companies that provide great hosting packages at reasonable prices are HostGator.com and 1and1.com.
Just like the address on your house, it is a means to uniquely identify the location of a web page.
(Asymmetric Digital Subscriber Loop/Line). Commonly called "DSL" or "highspeed internet connection" or simply "broadband"; it uses the telephone line to provide access to the internet. Speeds used to be 10 times that of a dial up connection, but the technology is continually evolving and growing faster. Note: upload and download speeds are usually different. Important: Unlike the dial up, you can still use your telephone for voice or fax at the same time.
- Anonymous FTP (Anon FTP)
A method for downloading and uploading files using FTP protocol without having a username or a password. In place of a username, word "anonymous" is used, and in place of a password, email address is usually used. If a hosting plan offers this service, your users will be able to download or upload files with FTP without having their own account.
- Anonymous remailer
A SMTP server that allows sending anonymous email messages. It removes or changes the "From" field of all messages that it processes.
Apache is an open-source (source code is freely available and can be shared) HTTP Web server software. According to Netcraft survey, it is currently the most popular web server on the Net. It is usually run on Unix operating system versions like Linux or BSD, but it can also be run on Windows. It is a full-featured server with many powerful add-ons freely available. Apache's major competitor is Microsoft's IIS.
Most often refers to a small Java program designed to run in a Web browser. Java applets run in a sandbox, so they can't perform unauthorized functions like file reading or opening Net connections to other computer from your computer.
Archives are large files containing valuable data. Archives are often compressed to save space.
One of the most popular compression formats.
(American Standard Code for Information Interchange). A standard for coding text files. Every character has an associated number and any text can be represented by a sequence of numbers.
- ASP - Active Server Pages
Active Server Pages. ASP is Microsoft's server-side scripting technology. An Active Server Page has an .asp extension and it mixes HTML and scripting code that can be written in VBScript or JScript. ASP is distributed with Microsoft's IIS web server, so most host using IIS will also offer ASP for dynamic web programming. ASP.NET is the next version of ASP. Other popular server-side scripting languages are Perl, PHP, ColdFusion, TCL, Python, and JSP.
- Asynchronous Transfer Mode (ATM)
A set of network protocols designed for multimedia transmission. Data is partitioned into cells (53 bytes each) and passed along a virtual circuit. ATM allows for building very high speed networks.
A part of an email message. Usually a file (a data file or a multimedia file) or a webpage. It is not a part of the text of the message, instead it is attached to the message.
Audio file format for Unix systems.
Authentication is used to confirm the identity of the other party involved in the data transmission.
Audio/Video Interleave. Audio file format used by Microsoft Widows.
Main high-speed network connection composing the Internet. Backbones are operated by major telecommunications companies like Sprint, MCI, or AT&T. Internet backbone maps are here.
Bandwidth is the amount of data that can be transferred over the network in a fixed amount of time. On the Net, it is usually expressed in bits per second (bps) or in higher units like Mbps (millions of bits per second). 28.8 modem can deliver 28,800 bps, a T1 line is about 1.5 Mbps.
The part of an email message that contains the actual text of the message.
A way of storing a frequently visited website address. It is then easier to access the website in the future.
An automated piece of software that can be used in chat rooms or to crawl the web.
Computer program that allows to search the World Wide Web and displays the content of the web pages. Examples are Mosaic, Netscape, Mozilla, Opera and Internet Explorer.
- Browser sniffing
The process in which the web site tries to determine what kind of web browser the user is using. This is done to suit the website to the particular capabilities of the browser.
Popular programming languages (C++ includes objects) that can be used to create server programs that run after compilation. C and C++ were not designed specifically for web programming, but they can still be useful, especially because mature compilers producing very fast code and large code libraries already exist.
- CA - Certifying Authority. Authority which signs public key (PK), so some third party can verify, that the PK is really belongs to whom it claims to belong.
- Cable Modem
A cable modem is used for connecting to the Internet using the cable TV infrastructure. It offers high speed Internet access.
- Cascading Style Sheets (CSS)
A style-sheet determines how the HTML document is displayed by the browser. The current version of CSS is version 2 (CSS2).
(Conseil Europeen pour la Recherche Nucleaire) A nuclear research laboratory where the World Wide Web was invented.
Digital ID used for SSL transactions. It includes owner's public key, the name of the owner, the issuer, hostname, and the expiration date.
- Certificate Authority
A company trusted by a browser maker that issues digital certificates that are supposed to guarantee that the company is what it claims to be for use in encrypted digital transactions though SSL. Verisign is the biggest certificate authority.
Common Gateway Interface. A standard for interfacing web servers with an executable application. A CGI program can be written in any language like Perl or C/C++ and it is often stored in a special directory like /cgi-bin. CGI is often used to process data from HTML forms.
A directory on the server where the executable CGI scripts reside.
A network architecture where a system is divided into two parts: the client and the server.
Connecting many computers and making them appear as one machine. This is done to increase reliability and performance.
- Co-location (colo)
Putting a web server in a dedicated facility that provides high-speed Internet connection, security, environment, backup power, and technical support. Unlike the dedicated server, the client controls both hardware and software.
ColdFusion is an easy to use server-side scripting language developed by Allaire. It comes with ColdFusion Studio, a visual IDE. Here is Allaire's ColdFusion page. Other popular server-side scripting languages are ASP. Perl, PHP, TCL, Python, and JSP.
- Command-line interface
The opposite of the GUI (Graphical User Interface). A way of interacting with a computer system using the keyboard and a text-only display. Usually more powerful, but less user-friendly thatn a GUI.
- Control Panel
Control panel included in web hosting packages is an online web-based application that allows you to easily manage different aspects of your account. Most control panels will let you upload files, add email accounts, change contact information, set up shopping carts or databases, view usage statistics, etc.
A Cookie is a piece of data that is saved in the user's browser by the web server. It is used to customize user's browsing experience.
Central Processing Unit. The most important part of the computer.
Also known as spider, an automated software that retrieves webpages and follows the hyperlinks contained in them. Used to generate indexes used by search engines.
In Web hosting, the total size of files transferred by an account in a month. Sites with a lots of graphics, downloads, or streaming audio or video and a lot of visitors will require plans with more available transfer.
Data in a structured format stored on a web server. Most popular type is a relational database. The most common query (information retrieval) language for relational databases is SQL. Linux-based hosts most commonly include MySQL database and Windows NT-based hosts usually include Access or MS SQL databases.
Similar to co-location, except that you lease or rent hardware from a Web host. The main advantage over co-location is easier upgrade and usually better support. Getting a dedicated server or co-locating is necessary for sites that outgrow shared servers because they use a lot of bandwidth and resources or they require total control over software environment.
Data Encryption Standard - an U.S. government approved cipher. It is easy to break in its simplest form, but used multiple times with key of at least 128 bits provides good security.
(Dynamic Host Configuration Protocol). An automated way of obtaining an IP address in the Local Area Network.
Dialup access is a way of connecting a computer to the Internet using a modem and the telephone line. It is rather slow and blocks the telephone line.
Domain Name System. Internet service that maps Internet domains into corresponding IP addresses. DNS database is distributed and replicated among many DNS servers, so when you change your domain's IP address, the changes take a while to propagate.
Domain name is an easy-to-remember address that can be translated by DNS into server's IP address. Domain names are hierarchical. Domain's suffix indicates which TLD (top level domain) it belongs to, for example .com, .gov, .org, .net, or .jp. Recently ICANN (Internet Corporation for Assigned Names and Numbers) added several new TLDs, like .biz, .pro., and .museum.
Digital Subscriber Line. A better way of connecting a computer to the Internet using the telephone line. It's faster than the dialup and doesn't block the telephone line. However, it is more expensive because the special equipment is required.
Using web and Internet technologies in conducting the business activities. Also expanding end enhancing traditional business practices by means of the Internet.
EMail (Electronic Mail,e-mail)
One of the most popular Internet services. Basically it's the transmission of text based messages. An email message can also contain more structured elements, like tables, images and multimedia. It can also be used to send various data files, by means of attachments. You have to have an email account in order to be able to use this service.
Encryption means encoding data using a cryptographic cipher. Encrypted data can be read (decrypted) only by an authorized entity.
Local Area Network (LAN) protocol invented by Xerox Corporation. It is a broadcast protocol that uses CSMA/CD method and utilizes electrical cables. It can run at various speeds: 10Mbps, 100Mbps and even 1000Mbps. IEEE 802.3 standard describes Ethernet. Word Ethernet is also sometimes used to describe the implementation that runs at the speed of 10Mbps.
A part of the company's network that is made accessible for some group of people. Sometimes protected by a password or some other kind of authentication. It allows users to access some of the non-public data, eg. a person's credit card balance.
(Frequently Asked Question) Lists of frequently asked questions and answers to them are used as a way of sharing knowledge on the web. They are a very good way of finding solutions to different problems. Some companies include them in their websites to minimize the number of Customer Support inquiries.
Fast Ethernet is the implementation of Ethernet standard that operates at the speed of 100Mbps.
(Federal Communications Commission). U.S.A. telecommunications regulatory organization. It controls standards that pertain to electronic and electromagnetic transmission and also licenses the frequencies and bandwidth for the commercial use.
(Fiber Distributed Data Interface). A very high speed network protocol. Uses fiber-optic cable, and is mainly used as the backbone network protocol due to its speed. It is also often the choice for critical applications due to its reliability.
Fibre Optic Cable
A cable used for transmitting data as a light wave. A fiber optic cable is composed of one or more optical fibers. It is more expensive that copper wire, but offers higher transmission speeds and allows for communication over larger distances.
Last three or four letters of a file name that appear after the dot. Used to designate the type of file and the format used.
Screening network packets for certain properties, such as the source or destination address, protocol used or even a pattern in the data. It is used in firewalls in order to decide if the traffic is to be forwarded or rejected. Provides the basis for network security.
Firewall refers to either software-only or separate software and hardware combination that serves to protect an internal network or a computer from attacks and unauthorized access by sitting between the Internet and the internal network.
An insulting email message sent to an individual as punishment for not adhering to the netiquette. Can be sometimes seen in the newsgroups or on internet message boards.
Microsoft's server-side applications that lets users of FrontPage Web site creation tool to incorporate "web-bots" that perform pre-packaged function like full-text Web site searching or adding a hit counter. FrontPage extensions are also available for Unix-based operating systems but some hosts refuse to use them because of potential security holes.
File Transfer Protocol. The Internet protocol defining how to download and upload files between a client and an FTP server. Popular client FTP programs are CuteFTP and WS_FTP. Major browser also have FTP capability.
Using email messages to access the FTP sites. Requires a special software installed on the server.
(Graphics Interchange Format) A graphic file format invented by Compuserve. One of the most widely used formats for internet and web. Uses a lossless compression method, thus ensuring that the quality of the image is not lowered.
Gigabit Ethernet is the Ethernet standard implementation that runs at 1000Mbps.
Graphical User Interface (GUI)
A way of interacting with the computer that relies on graphical symbols. Most often requires a mouse. It is less powerful then the command-line interface, but is more user friendly and is easier to learn for users without technical background.
In the WWW world "hit" is used to describe a single request made by a web browser. The data transmitted by the web server in response to the request is a text file or a binary file (images, audio, video, executables and other data).
Main web page owned by a company, organization or an individual. This is the page that is initially displayed when user makes a request for a particular domain name.
A networked computer dedicated to providing a certain kind of service. Usually refers to a computer that stores the website files and has a web server running on it.
(Hypertext Markup Language). It is the language in which web pages are written. It allows the images to be combined with text and offers wide range of formatting capabilities. One of the most important features of HTML is hypertext, that allows web pages to be liked one to each other. HTML relies on tags, which have the following form:
- an opening tag,
- a closing tag. HTML code is stored in a normal text file.
Hyper Text Transfer Protocol. The main protocol used to transfer and receive data over the World Wide Web. The latest version of HTTP is 1.1. Basic HTTP transaction involves a WWW browser connecting to a server, browser sending a request to the server specifying its capabilities and which document is requested, server responding with the required data, and closing of the connection. The overview of HTTP is here.
A part of the web page that links to another web page. By clicking on a hyperlink user redirects the browser to another page. The word hyperlink is sometimes shortened to just "link".
A text on the web page that is linked to another webpage. Browsers usually display hypertext as underlined and in blue color.
Microsoft Internet Information Server. Microsoft's Web server that comes built-in with Windows NT Server 4 and Windows 2000 server. Here is Microsoft's IIS web site.
An image displayed on the webpage that has different areas that are hyperlinks. By clicking on different parts of the image browser can be redirected to another webpage, or can display modified version of the current one.
Internet Message Access Protocol. A method allowing a client email program to access remote messages stored on a mail server. The protocol includes operations for creating, deleting, and renaming mailboxes, checking for new messages, message parsing, searching, and setting and clearing flags. IMAP was originally developed in 1986 at Stanford.
Not to be confused with internet (with lowercase i). The word Internet refers to all the computer networks worldwide that are connected together. TCP/IP is the de facto standard protocol set for Internet.
An extremely fast network that connects major cities. Most often it utilizes T3 circuits and provides the bandwidth of 45Mbps.
The organization that handles domain name registrations. See http://www.internic.net/
A part of an organization's network that is private. Only authorized individuals have access to the intranet. Besides that an intranet is very similar to the Internet in a sense that it offers the same services and uses the same protocols.
(Internet Protocol) is tha main protocol used on the Internet.
Internet Protocol Address. A unique number that identifies all devices connected to the Internet. It usually looks like groups of numbers from 0 to 255, separated by periods, such as 255.62.130.128.
IP packet is the basic data chunk that can be sent over the Internet. All the data is partitioned into IP packets on the sending computer and reassembled on the receiving computer.
Internet Relay Chat. Multi-user chat service. IRC users can go into public or private channels to discuss a topic or transfer files. IRC servers are connected into networks. The most popular IRC client program is mIRC. Many hosts are vary of letting customers access IRC because of a possibility of a denial of service attack on the whole network.
(Integrated Services Digital Network). An international standard that governs the transmission of both voice and data. It uses a digital circuits and has speed of 64Kbps. It can be used for normal telephone service as well as data transmission.
(International Organisation for Standardisation). An Geneva-based international organization that develops and publishes various international standards.
Internet Service Provider. A company that provides its subscribers with Internet access. Customers have a username and a password and can dial-up or use a cable or DSL line to connect to ISP's network which is connected to the Internet. The biggest ISP is AOL.
A popular compression format. Also a name of a compression utility.
Sun's popular programming language. Java is a platform-independent (at least in theory), crash-protected, object-oriented language that can be used to write applets that run in a browser, servlets that run server-side, or independent programs. Java's syntax is similar to that of C++.
Servlets are programs written in Java that run on a Web server and can produce dynamic pages. Also see JSP.
Java Virtual Machine (JVM, Java Runtime Environment)
A set of programs that allow for Java applets to be run on a particular computer system.
Java Database Connectivity - a mechanism allowing Java applets to access different databases.
(Joint Photographic Experts Group) A image compression format designed for the Internet. Uses lossy compression, meaning that the quality of the image can be lowered.
Java Server Pages. Extension of Java Servlet technology for combining Java server-side programs and HTML. JSP pages have an extension .jsp.
Kilobits per second. 1Kbps = 1024bps.
Protocol for transferring files over the modem connection. It's very slow.
Local Area Network. A network of devices (computers, printers, hubs) occupying a small area. Usually LANs do not span more than one building. LANs are very fast compared to WANs.
A dedicated communication line. User is charged a flat fee instead of being billed per minute of usage.
Another name for a connection. Sometimes refers to a physical line.
A free UNIX-like operating system developed by Linus Torvalds. Linux and FreeBSD are very often used by hosting companies as their operating systems.
An alias for an individual that is used for identification and authentication when accessing a computer system. Usually it is a sequence of characters and digits.
A compression scheme is loseless when decompressed file is exactly the same as the original. This is needed for compressing executable programs and data files.
A lossy compression allows for the quality of the compressed data to be diminished after decompression. It is suitable for audio, video and image compression.
Lempel Ziv Welch compression - a popular compression algorithm.
A way of having a group discussion with list subscribers by email. Emails are sent to all list subscribers. Popular mailing list programs, like Listserv and Majordomo, allow for automated subscription and un-subscription from a mailing list. Some hosting plans allow creation of mailing lists.
The Internet host (together with the appropriate software) that is used to send, receive and forward email messages.
1MB = 1024 KiloBytes = 1,048,576 bytes
Microsoft's low-end relational database included with the MS Office suite. Here is the official site for Microsoft Access. Cheaper NT hosting plans sometimes include MS Access database.
Microsoft's RAD tool for creating relational databases. FoxPro is a part of Microsoft's Visual Studio. Here is the official MS FoxPro site.
Microsoft SQL Server
Microsoft's high-end SQL database running on Windows systems. Here is the official Microsoft SQL Server site.
(Multipurpose Internet Mail Extensions) a method of including binary data and other multimedia content within email messages.
An FTP site that stores the exact content of some other site. Mirroring is done in order to minimize the load on a particular server and also to increase reliability.
Miva's XML based server-side scripting language. Available for multiple platforms. Here is Miva Empressa home page.
Browser-based storefront development and management system for merchants. Here is Miva Merchant home page.
MOdulator-DEModulator. A device used to transform digital data sent by a computer to analog format suitable for transmission over a telephone line. It also transforms analog signals back to the digital form. A modem is required for the dial up connection to the Internet.
An extremely popular lossy audio compression format. Widely used over the Internet.
(Motion Picture Experts Group) video compression format for movies or animations.
mSQL (Mini SQL)
Light-weight relational database. Here is mSQL home page.
Most popular open-source relational database. Many Unix-based plans allow MySQL databases. Here is MySQL home page.
Informal set of rules that should be followed when using internet services like email, message boards and newsgroups. Describes what it means to be "well behaved" while interacting with other people online.
A company that developed one of the most popular web browsers: Netscape Navigator.
A group of electronic devices connected together that are able to communicate with each other.
A virtual Internet place where people exchange thoughts, ideas and interests, amuse themselves and do a zillion other things, all by means of text messages.
An application that allows to use newsgroups.
(Open Database Connectivity) A standard allowing applications to access different databases in an uniform way.
The state of the computer when it is not connected to the network (i.e. it is not online).
Tha state of a computer when it is connected to the network and communicate with other machines.
A software heart of the computer. It is a set of programs that manage the hardware resources of a computer, provide the environment for application programs to run and provide the user interface. Most known operating systems are: different flavors of Unix (SunOs, HP-UX, Irix, FreeBSD, Linux,...), MacOS and Windows.
Name for a basic web document. Websites usually consist of many (web) pages.
(Password Authentication Protocol). PAP is the authentication protocol used over PPP connections.
A peer-to-peer network is a collection of computers that can communicate and share information, but that don't have any kind of hierarchical structure. This is the opposite of the client/server model.
Open source CGI scripting programming language. Written in 1987. Still one of the most popular web programming languages mostly due to its powerful text-manipulation facilities. A huge number of Perl scripts are available for download.
PHP is an free, open-source server-side scripting language. PHP code can be embedded in HTML. PHP files usually have extensions like .php or .php3. PHP language style is similar to C and Java. Here is the PHP group web site. Other popular server-side scripting languages are ASP, Perl, ColdFusion, TCL, Python, and JSP.
PKZIP or PKUNZIP
Popular compression and decompression programs.
An add-on piece of software that can extend the features of an existing application. For example Netscape browser plug-ins allow displaying of new types of web content, that the browser can't display on its own.
Short for Post Office Protocol, a protocol used to retrieve e-mail from a mail server. Most e-mail applications (sometimes called an e-mail client) use the POP protocol, although some can use the newer IMAP (Internet Message Access Protocol).
There are two versions of POP. The first, called POP2, became a standard in the mid-80's and requires SMTP to send messages. The newer version, POP3, can be used with or without SMTP.
Post Office Protocol. Popular but inflexible email retrieval standard. All messages are downloaded at the name time and can only be manipulated on a client machine. Current version is POP3. Also see IMAP.
A socket on the computer or other network device used to connect it to the network.
(Point to Point Protocol) A network protocol widely used to connect computers to the Internet. Most often used on a telephone line.
Interpreted programming language, sometimes offered by hosts for server-side scripting. Here is the Python home page.
Redundant Array of Independent (or Inexpensive) Disks. Type of disk, often used on servers, where several physical disks are combined into an array for better speed and fault tolerance.
Level 0 implements data striping where file blocks are written to separate drives. Does not provide fault tolerance, because failure of one drive will result in data loss.
Level 1 implements data mirroring. Data is duplicated on two drives either through software or hardware. Provides faster read performance than a single drive.
Level 2 - not used in practice. Data is split at bit level at written to multiple drives.
Level 3 - requires at least 3 drives. Data block is striped at byte level across drives and error correction codes (parity info) is recorder on another drive. Provides fault tolerance but slower writing performance.
Level 4 - Similar to Level 3 but provides faster performance because it uses blocks for striping.
Level 5 - Similar to Level 4 but improves performance but also striping parity info across multiple drives.
Level 6 - Similar to Level 5 but also uses second parity scheme for better fault tolerance.
Level 7 - Proprietary RAID design by Storage Computer Corporation. Faster than other levels because it uses multiple levels of cache and asynchronous I/O transfers.
In addition multiple RAID levels can be combined to improve performance or reliability.
Raw access data updated in real-time that can be downloaded and used by any statistics program. Typically each line show the user's IP, date and time of the access, what kind of request was done, which document was requested, HTTP status code, bytes transferred, referrer, and user agent info. If a host doesn't have statistics, you'll need access to raw logs to identify who your site's visitors are. Analyzing raw logs can also provide more detailed look at site accesses than stats.
Resellers are usually smaller companies that still try to build their customers base. They don't own the server with user accounts but can perform most administrative functions.
A public key cryptosystem developed by Ron Rivest, Adi Shamir, and Leonard Adleman. It can be used to encrypt session keys and to generate digital certificates.
Secure Multipurpose Internet Mail Extensions - a way of making email messages more secure. S/MIME uses digital certificates to attest the message origin and encryption to ensure that message could not be read while in transit.
Scalability The ability of a piece of software or hardware to expand/grow
as if necessary.
A programming language in which programs are the series of commands that are interpreted and then executed one by one. Doesn't require the compilation phase, for the price of lower performance.
An Internet service that stores a vast number of web pages and allows for fast searching among them. Also, a piece of software that implements a website search functionality.
An online form in which a query to the webpages database is specified.
A search engine uses search indexer to provide faster search.
An archived file that is also an executable program. The original archived file is decompressed when that program is run.
Serial Line Internet Protocol (SLIP)
Another network protocol used to connect computers to the networks. Similar to PPP.
A networked computer that handles client requests for Web pages.
A company that provides access to the Internet, usually for a fee.
All the data exchange between two parties, starting when the connection is established and ending when connection terminates.
Initial fee charged by a host to set up your hosting account. You can use our Price Search to minimize your costs over a period of time including the setup fee.
Shared hosting The most basic of Web hosting types. With shared hosting,
numerous Web sites are shared on one server. While an economic solution,
they typically cannot handle large amounts of storage or traffic.
Shopping cart A program designed to handle the e-commerce section of a Web site. Shopping cart software lets users browse for and purchase products online.
Software that allows users to select products from a Web catalog, modify their choices, calculate prices, review their choices, and order them. Many hosts with e-commerce plans offer installed shopping carts, but you can always get a shopping cart of your choice instead.
Secure HTTP. A version of HTTP protocol that uses encryption to assure that the traffic between the server and the browser cannot be eveasdropped on. Should be considered mandatory for all e-commerce applications.
A few lines of text that are automatically attached at the end of each email message by the email client. Usually it's some personal identification or an (un)interesting quote.
An applet that has a digital signature to confirm that it originates from the legitimate server.
Simple Mail Transfer Protocol. Very popular protocol used to transfer email messages across the Internet mail servers.
A normal paper mail delivered by the Post Office.
Unsolicited email sent in mass quantities to multiple receipents, most often for marketing purposes. Highly annoying and constituting one of the most serious netiquette violations.
An automated software that retrieves webpages and follows the hyperlinks contained in them. Used to generate indexes used by search engines.
Structured Query Language. Limited programming language used for updating and performing queries on relational databases. All databases share a common subset of SQL. Most popular SQL databases available with hosting plans are MySQL and MS SQL.
Secure Shell. Developed by SSH Communications Security, it is a standard for encrypted terminal Internet connections. SSH programs provide strong authentication and encrypted communications, replacing less secure access methods like telnet.
Server-Side Includes. Instructs the server to include some dynamic information in a Web page before it is sent to a client. This dynamic information could be current date, an opinion poll, etc. Many hosts require that SSI pages have .shtml extension to reduce the load on servers by not having to parse non-SSI pages.
Secure Sockets Layer. Protocol developed by Netscape to provide encryption for commercial transactions data that should be protected while traveling over the Internet, like credit card numbers. SSL uses https protocol. Before using SSL in commerce, you'll also need to get is a certificate from a Certificate Authority.
Static (or dedicated) IP
If a host offers a static IP, it means that your site will be assigned a unique and unchanging IP address. See the FAQ for some possible advantages of using a static IP.
Playing multimedia files (audio and video) without requiring a full download. Audio and video are compressed but they still may require a lot of bandwidth. Most popular streaming media formats are Real Audio/Video.
Using world wide web is often referred to as "surfing the web".
Dedicated telecommunications line providing 1.5 Mbps of bandwidth. Consists of 24 individual channels 64 Kbps each, that can be configured for voice or data transmission. T1 lines can be leased by businesses that required a dedicated Net connection of with higher reliability than a DSL and faster than an ISDN line but are still quite expensive.
Dedicated telecommunications line providing 44 Mbps of bandwidth. T3 lines are often used by ISPs to connect to the Internet backbone.
(Transmission Control Protocol) is the most important of the network protocols used in the Internet.
(Transmission Control Protocol/Internet Protocol). This protocol suite is the de facto standard for the today's Internet. TCP is a higher level protocol that runs on top of the IP protocol.
Character-based protocol for connecting with remote systems. Still popular among hosts, but it is being replaced by much more secure SSH access.
Top Level Domain. The domain name elements at the right, such as .com, .org or .gov. Recently, new TLDs like .pro or .museum were added to supplement older TLDs. ccTLDs (country code TLDs), like .uk or .fr are used per-country.
Data packets being transmitted over a network.
A family of multi-user operating systems, first developed by AT&T Bell Laboratories in the 1970s and then licensed to many universities. A basis for Linux, a very popular operating system among web hosts.
Uninterruptible Power Supply. UPS keeps the server running on a battery for several minutes after a power outage, allowing for a clean shutdown without loss of data. UPS can also shield the server from line voltage spikes and drops.
(Uniform Resource Locator) is a way of addressing used for world wide web. An URL consist of the type of service (protocol), then the host name and then the file on the host.
Network of all the newsgroups in the Internet.
It is a method of sending binary files using email - similar in purpose to MIME.
An stand-alone application used to display files of different formats. For example a QuickTime move viewer or a JPG file viewer.
A virus is a malicious program written to do as much harm as possible. Viruses can spread themselves over the network.
(Virtual Private Network). A virtual private network is a method of accessing the private network in a secure way over public communication lines and networks.
World Wide Web Consortium. An international industry consortium that develops standards for the world wide web.
An audio file format. Very accurate, but offers no compression, thus resulting in very large files.
A person responsible for the maintenance of a particular website.
An Internet service allowing to obtain the information about the domain name owner.
World Wide Web (or Web) is the most popular Internet service. It allows access to the information and services from the web servers. A web browser is needed to use the Web.
Extensible Markup Language. A meta-language, abbreviated version of SGML, used to specify other document types used on the Web. Accepted as a format in 1998 to replace dependence on HTML extensions. MSIE 5.5 and Netscape 6 both support XML.
A program that can XML documents and access their content.
A file transfer protocol. Rather slow.
A popular compression utility.
BACK / FORWARD
Buttons in most browsers' Tool Button Bar, upper left. BACK returns you to the document previously viewed. FORWARD goes to the next document, after you go BACK.
If it seems like the BACK button does not work, check if you are in a new Netscape window; some Web pages are programmed to open a new window when you click on some links. Each window has its own short-term search HISTORY. If this does not work, use GO to select the page you want (some Web pages are programmed to disable BACK).
BLOG or WEB LOG
A blog (short for "web log") is a type of web page that serves as a publicly accessible personal journal (or log) for an individual. Typically updated daily, blogs often reflect the personality of the author. Blog software usually has archives of old blogs, and is searchable. Frequently blogging software is used by web pages providing excellent information on many topics, although very frequently the content is personal and requires VERY careful evaluation.
Way in Netscape to store in your computer direct links to sites you wish to return to. The equivalent in Internet Explorer (IE) is called a "Favorite." To create a bookmark in Netscape, click on BOOKMARKS, then ADD BOOKMARK. Or left-click on and drag the little bookmark icon (in Netscape 4.6 and higher, to the right of the word BOOKMARK) to the place you want a new bookmark filed. To visit a bookmarked site, click on BOOKMARKS and select the site from the list.
The equivalent in Internet Explorer to Netscape's Bookmarks is called "Favorites."
You can download a bookmark file to diskette and install it on another computer. To do this in Netscape, select BOOKMARKS, then EDIT BOOKMARKS, then, in the FILE menu, select SAVE AS. To do this in IE, select from the main browser tool bar FILE, then Import and Export... and follow directions for exporting to a file. Import (part of the same IE program) allows you to bring a Netscape Bookmark file into IE as Favorites.
Way to combine terms using "operators" such as "AND," "OR," "AND NOT" and sometimes "NEAR." AND requires all terms appear in a record. OR retrieves records with either term. AND NOT excludes terms. Parentheses may be used to sequence operations and group words. Always enclose terms joined by OR with parentheses. Which search engines have this?
See +REQUIRE or -REJECT TERM and FUZZY AND. Want a more extensive explanation of Boolean logic, with illustrations?
To follow links in a page, to shop around in a page, exploring what's there, a bit like window shopping. The opposite of browsing a page is searching it. When you search a page, you find a search box, enter terms, and find all occurrences of the terms throughout the site. When you browse, you have to guess which words on the page pertain to your interests. Searching is usually more efficient, but sometimes you find things by browsing that you might not find because you might not think of the "right" term to search by.
Browsers are software programs that enable you to view WWW documents. They "translate" HTML-encoded files into the text, images, sounds, and other features you see. Microsoft Internet Explorer (called simply IE), Netscape, Mosaic, Macweb, and Netcruiser are examples of browsers that enable you to view text and images and many other WWW features. They are software that must be installed on your computer. For more information about browsers, consult the introductory pages of the Teaching Library tutorial. See also LYNX, a browser often used from slow modems because it does not display images, colors, or sound, but lets you perform most basic WWW functions and see the content.
In browsers, "cache" is used to identify a space where web pages you have visited are stored in your computer. A copy of documents you retrieve is stored in cache. When you use GO, BACK, or any other means to revisit a document, the browser first checks to see if it is in cache and will retrieve it from there because it is much faster than retrieving it from the server.
In search results from Google, Yahoo! Search, and some other search engines, there is usually a Cached link which allows you to view the version of a page that the search engine has stored in its database. The live page on the web might differ from this cached copy, because the cached copy dates from whenever the search engine's spider last visited the page and detected modified content. Use the cached link to see when a page was last crawled and, in Google, where your terms are and why you got a page when all of your search terms are not in it.
Capital letters (upper case) retrieve only upper case. Most search tools are not case sensitive or only respond to initial capitals, as in proper names. It is always safe to key all lower case (no capitals), because lower case will always retrieve upper case. Which search engines have this?
A message from a WEB SERVER computer, sent to and stored by your browser on your computer. When your computer consults the originating server computer, the cookie is sent back to the server, allowing it to respond to you according to the cookie's contents. The main use for cookies is to provide customized Web pages according to a profile of your interests. When you log onto a "customize" type of invitation on a Web page and fill in your name and other information, this may result in a cookie on your computer which that Web page will access to appear to "know" you and provide what you want. If you fill out these forms, you may also receive e-mail and other solicitation independent of cookies.
DOMAIN, TOP LEVEL DOMAIN (TLD)
Hierarchical scheme for indicating logical and sometimes geographical venue of a web-page from the network. In the US, common domains are .edu (education), .gov (government agency), .net (network related), .com (commercial), .org (nonprofit and research organizations). Outside the US, domains indicate country: ca (Canada), uk (United Kingdom), au (Australia), jp (Japan), fr (France), etc. Neither of these lists is exhaustive. See also DNS entry.
DOMAIN NAME, DOMAIN NAME SERVER (DNS)ENTRY
Any of these terms refers to the initial part of a URL, down to the first /, where the domain and name of the host or SERVER computer are listed (most often in reversed order, name first, then domain). The domain name gives you who "published" a page, made it public by putting it on the Web.
A domain name is translated in huge tables standardized across the Internet into a numeric IP address unique the host computer sought. These tables are maintained on computers called "Domain Name Servers." Whenever you ask the browser to find a URL, the browser must consult the table on the domain name server that particular computer is networked to consult.
"Domain Name Server entry" frequently appears a browser error message when you try to enter a URL. If this lookup fails for any reason, the "lacks DNS entry" error occurs. The most common remedy is simply to try the URL again, when the domain name server is less busy, and it will find the entry (the corresponding numeric IP address). For more information, see "All About Domain Names."
To copy something from a primary source to a more peripheral one, as in saving something found on the Web (currently located on its server) to diskette or to a file on your local hard drive. More information.
EXTENSION or FILE EXTENSION
In Windows, DOS and some other operating systems, one or several letters at the end of a filename. Filename extensions usually follow a period (dot) and indicate the type of file. For example, this.txt denotes a plain text file, that.htm or that.html denotes an HTML file. Some common image extensions are picture.jpg or picture.jpeg or picture.bmp or picture.gif
In the Internet Explorer browser, a means to get back to a URL you like, similar to Netscape's Bookmarks.
Ability to limit a search by requiring word or phrase to appear in a specific field of documents (e.g., title, url, link). See LIMITING TO FIELD.
Button in Netscape Tool Button Bar at top. Searches for word(s) keyed in document in screen only. Useful to locate a term in a long document. Can be invoked by the keyboard command, Ctrl+F.
How up-to-date a search engine database is, based primarily on how often its spiders recirculate around the Web and update their copies of the web pages they hold, and discover new ones. Also determined by how quickly they integrate new sites that web authors send to them. Two weeks is about as good as most search engines do, but some update certain selected web sites more frequently.
A format for web documents that divides the screen into segments, each with a scroll bar as if it were as "window" within the window. Usually, selecting a category of documents in one frame shows the contents of the category in another frame. To go BACK in a frame, position the cursor in the frame an press the right mouse button, and select "Back in frame" (or Forward).
You can adjust frame dimensions by positioning the cursor over the border between frames and dragging the border up/down or right/left holding the mouse button down over the border.
File Transfer Protocol. Ability to transfer rapidly entire files from one computer to another, intact for viewing or other purposes.
In ranking of results, documents with all terms (Boolean AND) are ranked first, followed by documents containing any terms (Boolean OR) are retrieved. The farther down, the fewer the terms, although at least one should always be present.
Button in Netscape Menu Bar at top. Provides list of recent sites you visited, retained for the current session only. Click on any site in the list to return to the site. For a more permanent marker, make a BOOKMARK.
HEAD or HEADER (of HTML document)
The top portion of the HTML source code behind Web pages, beginning with <HEAD> and ending with </HEAD>. It contains the Title, Description, Keywords fields and others that web page authors may use to describe the page. The title appears in the title bar of most browsers, but the other fields cannot be seen as part of the body of the page. To view the <HEAD> portion of web pages in Netscape, click VIEW, Page Source. In Internet Explorer, click VIEW, Source. Some search engines will retrieve based on text in these fields.
HISTORY, Search History
Available by using the combined keystrokes CTRL + H, a more permanent record of sites you have visited/retrieved than GO. You can set how many days your Netscape retains history in Edit | Preferences, and in Internet Explorer in Tools | Internet Options ? General.
Computer that provides web-documents to clients or users. See also server.
Hypertext Markup Language. A standardized language of computer code, imbedded in "source" documents behind all Web documents, containing the textual content, images, links to other documents (and possibly other applications such as sound or motion), and formatting instructions for display on the screen. When you view a Web page, you are looking at the product of this code working behind the scenes in conjunction with your browser. Browsers are programmed to interpret HTML for display.
You can see HTML in Netscape by selecting the View pop-down menu tab, then "Document Source." If you download a document as "Source," the file will contain HTML markup codes and can be viewed in Netscape and other browsers.
On the World Wide Web, the feature, built into HTML, that allows a text area, image, or other object to become a "link" (as if in a chain) that retrieves another computer file (another Web page, image, sound file, or other document) on the Internet. The range of possibilities is limited by the ability of the computer retrieving the outside file to view, play, or otherwise open the incoming file. It needs to have software that can interact with the imported file. Many software capabilities of this type are built into browsers or can be added as "plug-ins."
INTERNET (Upper case I)
The vast collection of interconnected networks that all use the TCP/IP protocols and that evolved from the ARPANET of the late 60’s and early 70’s. An "internet" (lower case i) is any computers connected to each other (a network), and are not part of the Internet unless the use TCP/IP protocols. An "intranet" is a private network inside a company or organization that uses the same kinds of software that you would find on the public Internet, but that is only for internal use. An intranet may be on the Internet or may simply be a network.
IP Address or IP Number
(Internet Protocol number or address). A unique number consisting of 4 parts separated by dots, e.g. 220.127.116.11
Every machine that is on the Internet has a unique IP address. If a machine does not have an IP number, it is not really on the Internet. Most machines also have one or more Domain Names that are easier for people to remember.
ISP or Internet Service Provider
A company that sells Internet connections via modem (examples: aol, Mindspring - thousands of ISPs to choose from; not easy to evaluate). Faster, more expensive Internet connectivity is available via cable, DSL, ISDN, or web-TV. Often these companies also provide Web page hosting service (free or relatively inexpensive web pages -- the origin of many personal pages).
A network-oriented programming language invented by Sun Microsystems that is specifically designed for writing programs that can be safely downloaded to your computer through the Internet and immediately run without fear of viruses or other harm to our computer or files. Using small Java programs (called "Applets"), Web pages can include functions such as animations, calculators, and other fancy tricks. We can expect to see a huge variety of features added to the Web using Java, since you can write a Java program to do almost anything a regular computer program can do, and then include that Java program in a Web page. For more information search any of these jargon terms in the PC Webopedia.
A simple programming language developed by Netscape to enable greater interactivity in Web pages. It shares some characteristics with JAVA but is independent. It interacts with HTML, enabling dynamic content and motion.
A word searched for in a search command. Keywords are searched in any order. Use spaces to separate keywords in simple keyword searching. To search keywords exactly as keyed (in the same order), see PHRASE.
LIMITING TO A FIELD
Requiring that a keyword or phrase appear in a specific field of documents retrieved. Most often used to limit to the "Title" field in order to find documents primarily about one or more keywords. (Can be used for other fields. See the table summarizing search tools features.)
The URL imbedded in another document, so that if you click on the highlighted text or button referring to the link, you retrieve the outside URL. If you search the field "link:", you retrieve on text in these imbedded URLs which you do not see in the documents.
Term used to describe the frustrating and frequent problem caused by the constant changing in URLs. A Web page or search tool offers a link and when you click on it, you get an error message (e.g., "not available") or a page saying the site has moved to a new URL. Search engine spiders cannot keep up with the changes. URLs change frequently because the documents are moved to new computers, the file structure on the computer is reorganized, or sites are discontinued. If there is no referring link to the new URL, there is little you can do but try to search for the same or an equivalent site from scratch.
A discussion group mechanism that permits you to subscribe and receive and participate in discussions via e-mail. For more information see the Beyond General Web Searching Listservers section or attend Part III of these Web courses.
Lynx is a "browser" program like Netscape or Internet Explorer that can access information on World Wide Web, but without access to images, film, or sound. It is used often from slow modems to eliminate the need to wait to download images and other features. Lynx allows you to read the text of any WWW document, and to select hypertext links in these documents. You can use Lynx to go to any WWW document, to fill out forms available on WWW, to print and save files and perform many other tasks. For information on how to use Lynx, see Lynx Basics.
Search engines that automatically submit your keyword search to several other search tools, and retrieve results from all their databases. Convenient time-savers for relatively simple keyword searches (one or two keywords or phrases in " "). See Meta-Search Engines page for complete descriptions and examples.
A term used in Boolean searching to indicate the sequence in which operations are to be performed. Enclosing words in parentheses identifies a group or "nest." Groups can be within other groups. The operations will be performed from the innermost nest to the outmost, and then from left to right.
A discussion group operated through the Internet. Not to be confused with LISTSERVERS which operate through e-mail. For more information see the Beyond General Web Searching Usenet Newsgroups section.
A web page created by an individual (as opposed to someone creating a page for an institution, business, organization, or other entity). Often personal pages contain valid and useful opinions, links to important resources, and significant facts. One of the greatest benefits of the Web is the freedom it as given almost anyone to put his or her ideas "out there." But frequently personal pages offer highly biased personal perspectives or ironical/satirical spoofs, which must be evaluated carefully. The presence in the page's URL of a personal name (such as "jbarker") and a ~ or % or the word "users" or "people" or "members" very frequently indicate a site offering personal pages.
PACKET, PACKET JAM
When you retrieve a document via the WWW, the document is sent in "packets" which fit in between other messages on the telecommunications lines, and then are reassembled when they arrive at your end. This occurs using TCP/IP protocol. The packets may be sent via different paths on the networks which carry the Internet. If any of these packets gets delayed, your document cannot be reassembled and displayed. This is called a "packet jam." You can often resolve packet jams by pressing STOP then RELOAD. RELOAD requests a fresh copy of the document, and it is likely to be sent without jamming.
PDF or .pdf or pdf file
Abbreviation for Portable Document Format, a file format developed by Adobe Systems, that is used to capture almost any kind of document with the formatting in the original. Viewing a PDF file requires Acrobat Reader, which is built into most browsers and can be downloaded free from Adobe.
More than one KEYWORD, searched exactly as keyed (all terms required to be in documents, in the order keyed). Enclosing keywords in quotations " " forms a phrase in AltaVista, , and some other search tools. Some times a phrase is called a "character string."
An application built into a browser or added to a browser to enable it to interact with a special file type (such as a movie, sound file, Word document, etc.)
POPULARITY RANKING of search results
Some search engines rank the order in which search results appear primarily by how many other sites link to each page (a kind of popularity vote based on the assumption that other pages would create a link to the "best" pages). Google is the best example of this. See also Subject-Based Ranking.
+REQUIRE or -REJECT A TERM OR PHRASE
Insert + immediately before a term (no space) to limit search to documents containing a term. Insert - immediately before a term (no space) to exclude documents containing a term. Can be used immediately (no space) before the " " delimiting a phrase.
Functions partially like basic BOOLEAN LOGIC. If + precedes more than one term, they are required as with Boolean AND. If - is used, terms are excluded as with Boolean AND NOT. If neither + no - is used, the default if Boolean OR. However, full Boolean logic allows parentheses to group and sequence logical operations, and +/- do not. Which search engines have this?
RELEVANCY RANKING of search results
The most common method for determining the order in which search results are displayed. Each search tool uses its own unique algorithm. Most use "fuzzy and" combined with factors such as how often your terms occur in documents, whether they occur together as a phrase, and whether they are in title or how near the top of the text. Popularity is another ranking system.
A script is a type of programming language that can be used to fetch and display Web pages. There are may kinds and uses of scripts on the Web. They can be used to create all or part of a page, and communicate with searchable databases. Forms (boxes) and many interactive links, which respond differently depending on what you enter, all require some kind of script language. When you find a question marke (?) in the URL of a page, some kind of script command was used in generating and/or delivering that page. Most search engine spiders are instructed not to crawl pages from scripts, although it is usually technically possible for them to do so (see Invisible Web for more information).
SCROLL (DOWN, UP, LEFT, RIGHT)
Moving up or down within a document in your screen. Use scroll bar at right. Click on arrow down or arrow up. Drag the scroll button down or up. Or click on the page up or page down icons at the bottom of the bar. If you need to scroll left or right, use the scroll bar at the bottom.
SERVER, WEB SERVER
A computer running that software, assigned an IP address, and connected to the Internet so that it can provide documents via the World Wide Web. Also called HOST computer. Web servers are the closest equivalent to what in the print world is called the "publisher" of a print document. An important difference is that most print publishers carefully edit the content and quality of their publications in an effort to market them and future publications. This convention is not required in the Web world, where anyone can be a publisher; careful evaluation of Web pages is therefore mandatory. Also called a "Host."
Something that operates on the "server" computer (providing the Web page), as opposed to the "client" computer (which is you or someone else viewing the Web page). Usually it is a program or command or procedure or other application causes dynamic pages or animation or other interaction.
SHTML, usually seen as .shtml
An file name extension that identifies web pages containing SSI commands.
SITE or WEB-SITE
This term is often used to mean "web page," but there is supposed to be a difference. A web page is a single entity, one URL, one file that you might find on the Web. A "site," properly speaking, is an location or gathering or center for a bunch of related pages linked to from that site. For example, the site for the present tutorial is the top-level page "Internet Resources." All of the pages associated with it branch out from there -- the web searching tutorial and all its pages, and more. Together they make up a "site." When we estimate there are 5 billion web pages on the Web, we do not mean "sites." There would be far fewer sites.
Computer robot programs, referred to sometimes as "crawlers" or "knowledge-bots" or "knowbots" that are used by search engines to roam the World Wide Web via the Internet, visit sites and databases, and keep the search engine database of web pages up to date. They obtain new pages, update known pages, and delete obsolete ones. Their findings are then integrated into the "home" database.
Most large search engines operate several robots all the time. Even so, the Web is so enormous that it can take six months for spiders to cover it, resulting in a certain degree of "out-of-datedness" (link rot) in all the search engines. For more information, read about search engines.
SPONSOR (of a Web page or site)
Many Web pages have organizations, businesses, institutions like universities or nonprofit foundations, or other interests which "sponsor" the page. Frequently you can find a link titled "Sponsors" or an "About us" link explaining who or what (if anyone) is sponsoring the page. Sometimes the advertisers on the page (banner ads, links, buttons to sites that sell or promote something) are "sponsors." WHY is this important? Sponsors and the funding they provide may, or may not, influence what can be said on the page or site -- can bias what you find, by excluding some opposing viewpoint or causing some other imbalanced information. The site is not bad because of sponsors, but you they should alert you to the need to evaluate a page or site very carefully.
SSI stands for "server-side include," a type of HTML instruction telling a computer that serves Web pages to dynamically generate data, usually by inserting certain variable contents into a fixed template or boilerplate Web page. Used especially in database searches.
In keyword searching, word endings are automatically removed (lines becomes line); searches are performed on the stem + common endings (line or lines retrieves line, lines, line's, lines', lining, lined). Not very common as a practice, and not always disclosed. Can usually be avoided by placing a term in " ".
In database searching, "stop words" are small and frequently occurring words like and, or, in, of that are often ignored when keyed as search terms. Sometimes putting them in quotes " " will allow you to search them. Sometimes + immediately before them makes them searchable. See Table of Search Engine features.
SUBJECT-BASED POPULARITY RANKING of search results
A variation on popularity ranking in which the links in pages on the same subject are used to in ranking search results. Used by Teoma.
An approach to Web documents by a lexicon of subject terms hierarchically grouped. May be browsed or searched by keywords. Subject directories are smaller than other searchable databases, because of the human involvement required to classify documents by subject.
Ability to search only within the results of a previous search. Enables you to refine search results, in effect making the computer "read" the search results for you selecting documents with terms you sub-search on. Can function much like RESULTS RANKING. Which search engines have this?
(Transmission Control Protocol/Internet Protocol) -- This is the suite of protocols that defines the Internet. Originally designed for the UNIX operating system, TCP/IP software is now available for every major kind of computer operating system. To be truly on the Internet, your computer must have TCP/IP software. See also IP Address.
Internet service allowing one computer to log onto another, connecting as if not remote.
In some search tools, the terms you choose to search on can lead you to other terms you may not have thought of. Different search tools have different ways of presenting this information, sometimes with suggested words you may choose among and sometimes automatically. The terms are based on the terms in the results of your search, not on some dictionary-like thesaurus.
TITLE (of a document)
The official title of a document from the "meta" field called title. The text of this meta title field may or may not also occur in the visible body of the document. It is what appears in the top bar of the window when you display the document and it is the title that appears in search engine results. The "meta" field called title is not mandatory in HTML coding. Sometimes you retrieve a document with "No Title" as its supposed title; this is caused when the meta-title field is left blank.
In Alta Vista and some other search tools, title: search also matches on the "meta" field, which contains document descriptors not displayed on the Web. See also LIMITING TO A FIELD.
In a search, the ability to enter the first part of a keyword, insert a symbol (usually *), and accept any variant spellings or word endings, from the occurrence of the symbol forward. (E.g., femini* retrieves feminine, feminism, feminism, etc.) Which search engines have this?
Uniform Resource Locator. The unique address of any Web document. May be keyed in Netscape's OPEN or Netscape's LOCATION / GO TO box to retrieve a document. There is a logic the layout of a URL:
Anatomy of a URL:
Type of file (could say ftp:// or telnet://) Domain name (computer file is on and its location on the Internet) Path or directory on the computer to this file Name of file, and its file extension (usually ending in .html or .htm)
http:// www.lib.berkeley.edu/ TeachingLib/Guides/Internet/ FindInfo.html
Bulletinboard-like network featuring thousands of "newsgroups." For more information see the Beyond General Web Searching discussion group section.
Different word endings (such as -ing, -s, es, -ism, -ist,etc.) will be retrieved only if you allow for them in your search terms. One way to do this TRUNCATION, but few systems accept truncation. Another way is to enter the variants either separated by BOOLEAN OR (and grouped in parentheses). In +REQUIRE/-REJECT non-Boolean systems, enter the variant terms preceded with neither + nor -, because this will allow documents containing any of them to retrieved.
A variant of HTML. Stands for Extensible Hypertext Markup Language is a hybrid between HTML and XML that is more universally acceptable in Web pages and search engines than XML.
Extensible Markup Language, a dilution for Web page use of SGML (Standard General Markup Language), which is not readily viewable in ordinary browsers and is difficult to apply to Web pages. XML is very useful (among other things) for pages emerging from databases and other applications where parts of the page are standardized and must reappear many times. See XHTML.