Although creating HTML pages is easily the most time-consuming part of building your Web site, another equally important part is figuring out how you're going to get those pages on the Web. You'll need Web server software, an Internet connection, a Web URL for your pages, and a system for organizing your pages and graphics. Depending on how you gain access and how complicated your site is, just getting your first page up on the Web can take a certain amount of planning.
Before you can display your HTML pages on the Web, you'll need access to a Web server. This may already be taken care of for you, especially if you work with an Information Systems (IS) department in a larger corporation. If this is the case, you'll just need to know how and where to send your HTML files when you want to update the site. Otherwise, you'll need to make some arrangements on your own.
It isn't terribly difficult to set up your own Web server-especially if you already have a high-speed connection to the Internet. If you access the Internet through an Internet service provider (ISP), you'll want to discuss this with them, though. More than likely, they're willing to provide you with space on their Web server computers. If your Web site is a fairly small venture, or if you're not ready for a heavy investment in equipment, then using your ISP's Web server is a great (and very common) alternative.
In its essence, it's the job of a Web server to accept connections from Web browsers all over the Internet and, when requested, send them the HTML documents that are available from your site. This is done using the HTTP protocol discussed in Chapter 2.
A Web server is simply a computer with an Internet connection that runs software designed to send out HTML pages and other file formats (such as multimedia files) (see fig. 5.1). The server computer should have a relatively high-speed connection to the Internet (faster than any available modem connections, for instance) and be powerful enough to deal with a number of simultaneous connections from the Internet.
Figure 5.1: WebSTAR Web server software running on a Macintosh computer.
Web server software generally requires a fairly robust operating system (like UNIX, Windows NT, or OS/2), although software is available for other versions of Microsoft Windows, and the Macintosh OS is a very popular choice for Web server computers. The software you use depends on your level of experience with Internet connections and various operating systems.
The other major consideration is how popular your Web site will be. The more hits, or connections, your Web server receives at one time, the more powerful the computer should be-and the faster your connection to the Internet. What do I mean by a fast connection?
Most Internet connections are measured in terms of bits per second (bps), which translates loosely as "how many bits of data can be transmitted across the Internet in a second." In computerese, it takes eight bits to make up one byte of computer information-and a byte is what is required to create a character of text.
The typical modem connection is 14,400 bps, which translates to roughly 1,800 characters (bytes) transferred every second. If a typical page of text contains 300 words then, and each word averages six characters per word, this connection would yield roughly a page-per-second transmission rate. A 25-kilobyte (KB) file (such as a very small GIF file) would take about 14 seconds to transmit over this connection.
This doesn't sound terribly slow, until you start to take into
account the idea that more than one connection might occur with
the Web server. If ten people connect to our server over this
connection, it will take ten seconds to complete the task of sending
each of them a single page of data. If that page totaled 25 KB
in size (that is, if it included graphics and other elements),
it could take over 140 seconds to complete that same task.
Note |
These transmission rate numbers all reflect ideal conditions. In real life, phone line noise, traffic on the Internet, and other factors will slow down transmission rates. Throughput on a 14,400 bps connection is often somewhere between 1,100 and 1,300 characters per second. |
If the typical well-designed Web page is between 30 KB and 50 KB in size, you can see that we're going to start running into problems with this type of connection. There's the potential for someone to wait a number of minutes between the transfer of each page they request on your Web site. If the average commercial break on television is three minutes, just think how annoyed your users are going to get.
So your server will need a faster connection. But how do you get one? If Internet access is available to you through your company's Local Area Network (LAN), you probably already have a high-speed connection. Ask around your IS department. If you're running a small business or home office, you won't have to worry about high speed if you make your Web pages available on your ISP's Web server.
If you're going to use your own Web server computer, though, you'll
need a high-speed Internet connection that you can connect to
that computer. Table 5.1 details some of the possible connections.
Connection Speed | Connection Technology |
14.4/28.8 Kbps | High-speed modem |
56 Kbps | 56K leased line |
64 Kbps | Single-B-Channel ISDN |
128 Kbps | Basic Rate ISDN |
up to 1.5 Mbps | Primary Rate ISDN (U.S.) |
1.5 Mbps | T-1 dedicated line |
45 Mbps | T-3 dedicated line |
The minimum for an acceptable Web server connection is probably
a basic-rate ISDN (Integrated Services Digital Network) connection,
which offers 128,000 bps connections to the Internet. ISDN technology
uses your existing phone wiring to provide an enhanced, digital,
telephone connection. Using a special network adapter card for
your computer, you can use the ISDN line to dial an appropriately
equipped ISP. You can also use the ISDN connection for regular
telephone calls.
Note |
ISDN is a service of your local telephone company, and you should contact them for more information. Also be aware that emerging technologies such as cable modems (offered by your cable TV company) may be another high-speed alternative. |
The basic-rate ISDN connection is still somewhat slow, depending
on your Web site's traffic (that is, the number of visitors to
your site). But it's also the most reasonably priced, generally
falling between $50 and $150 a month for the ISDN line (from your
local phone company), with $50 to $100 for the ISDN account (from
your ISP), and $300 to $1,000 to purchase the ISDN equipment.
Tip |
Relatively low-cost ISDN "modems" are becoming more common for both PCs and Macs in the $300 to $500 range. |
A T-1 line is the typical connection for an ISP or a large business, and these lines generally cost thousands of dollars per month for Internet access, as do primary-rate ISDN connections. T-3 lines currently serve as the backbone of the Internet, and are generally only found connecting university, government, and supercomputing organizations.
For any sort of connection to the Internet, you'll probably need
to deal with an Internet service provider. These companies offer
dial-up and special high-speed connections to the Internet, as
well as generally offering Web and other types of Internet servers
for your use.
Note |
Looking for a provider for your Web page? With your Web browser, you can access some lists of ISPs around the country (and world) at http://thelist.com |
For the typical smaller Web site, you'll want to buy space on the ISP's Web site. Generally this will give you an URL that begins with the name of the ISP's host computer, but points to a special directory for your HTML pages, such as http://www.isp.com/username/index.html.
With most Web server programs, the default page that is first loaded is named index.html, so that's the name you'll use for the first page you'd like presented to users when they access your Web site.
If you're looking for an ISP for your Web site (as opposed to using your company's computers or your current ISP), it's important to consider two factors. Most ISPs will charge you based on how much disk space your Web site consumes and how much throughput is registered for your pages.
Throughput can be seen as the average amount of information transferred from your site to a user multiplied by the number of users who access your Web site:
average amount of information X number of users = throughput
If, for instance, each user who accesses your site transfers an average of 50 KB, and 1,000 users access your site in a month, then your throughput for that month would be 5 MB of data. If your ISP charges $1 per megabyte of throughput, you'll be charged $5 (not including the disk space charges and any monthly fees the ISP may charge).
So why charge for throughput? If hundreds of people access your
site at any given time, this means that many fewer people can
access other services provided by the ISP, so they charge you
more. Consider the scenario where everyone is downloading a 250
KB shareware program from your Web site, and over 10,000 people
access your Web site in a month. This is approximately 2.5 gigabytes
of data transferred, for which you might be charged $2,500 (at
$1 per megabyte).
Tip |
Look for Web sites that offer monthly maximums and special deals to avoid surprise bills for hundreds or thousands of dollars. |
A sum of $2,500 is a little high for that sort of traffic, but it does make a good point-many ISPs will limit your site to a certain amount of data transferred or a certain number of visitors per month (for a particular price plan). To get past these limitations, you may have to opt for the next higher plan available from the ISP, or accept additional charges for extra throughput.
Once you've decided on an ISP that you feel is reasonably priced, you're ready to create your HTML pages and upload them to the server. To do all this correctly, though, you'll probably need to ask a few questions:
Note |
Many ISPs will give you the option, at an increased price, of creating your own domain name for your site. Then users could access your site at http://www.yourname.com/. |
Tip |
When in doubt, use the DOS 8.3 filename convention in the style filename.ext where filename can be no more than eight letters, and .ext is a three-letter filename extension, such as .htm. |
The most important thing to remember when organizing a Web site is how the server computer you're using will differ from the computer you use to create Web pages. This is because you'll need to know the exact path to HTML pages and multimedia files you use in creating your Web page. As we've seen before, an URL requires both a server name and a path statement to the file. This includes files that you've placed on your own Web server-so while you're creating your Web pages, you'll need to know where your files will eventually be.
Although there are a number of different ways to arrange a Web site, there are some rules of thumb to keep in mind. For the most part, any organization you create for your Web site files should be designed to make updating your pages easy in the future. If you have to move all your files around every time you change something on a Web page, you'll also be forced to change all the hypertext links on many other pages-and that can be incredibly time-consuming.
Let's look at a couple of different types of organization for Web sites:
We've already mentioned that file extensions are an important part of all the filenames you use for your Web site. Because other Web browsers may rely on the file extension to know what sort of document or file it is, you'll need to include the appropriate extensions with all your Web site files.
Your Web site will almost always begin with a file called index.html. Most Web server software programs will automatically load this page if the URL of your site is accessed without a specific path and file reference. For example, entering http://www.sun.com/ in your browser actually results in the URL http://www.sun.com/index.html being loaded in your browser. Your Web site's first page (whether it's a "front door" page or the first page of your site) should be designed with this in mind. If you plan to offer only Netscape-enhanced pages, for instance, you'll want to let your users know this on the index.html page.
The other consideration for naming your files is the organization you plan to use for your site. If you're using a single-directory organization, your filenames should be as unique as possible, and graphics and other files should probably have names that relate to associated Web pages. For instance:
about_company.html
about_header.jpeg
about_ceo_photo.jpeg
When possible, these names will help you determine which files
are associated with which HTML pages when you go to update those
files.
Note |
Remember that it's important to know what operating system your server uses. Some of the suggestions in this section for styles of filenames will not be helpful if you're using a DOS-based server, since names are limited to the 8.3 format. |
For graphics and other files that show up on multiple pages, you might want to come up with a memorable prefix, like gen_ or site, just so you can easily replace these universal files when necessary.
To create a reasonably sized site for my home-business Web site, I'm going to use the hybrid style of organization. I have three different sections on my site: About My Business, Services, and Samples. Each of these sections will have its own directory structure. Graphics will be in their own subdirectory, as will downloadable files that I'm including (see fig. 5.2).
Figure 5.2: The directory organization for my site.
The directory names, then, will be as follows:
about_pages
service_pages
sample_pages
graphics
sample_files
Files and graphics are named for where they appear, unless they show up in multiple Web pages. For this site, the prefixes I'm using are as follows:
about_
serv_samp_
gen_
index_
By naming files in this way, I'll be able to replace any graphics or update my sample files easily-without being forced to load each file or graphic to figure out what it is. Making the names as descriptive as possible (aside from the prefix) will help too, as in the following:
about_photo_me.jpeg
samp_resume1.doc
sampl_catalog_copy.txt
If you organize your site well, updating the site is simply a matter of replacing an outdated file with a new file using the same filename. For instance, if I wanted to replace the picture of me in the previous example, I'd simply name the new file about_photo_me.jpeg, and save it in the same directory. Now the associated Web page will load the new graphic without requiring any changes to the HTML codes.
You'll need to check with your company's IS contact or your ISP to figure out exactly how you'll update files. With an ISP, you can generally use an FTP program to put new files in your directory organization on the Web site. You might instead be required to use a UNIX-based shell account for your uploading. In either case, it's a fairly simple process.
Your Web space provider will require you to enter a username and
password to gain access to the Web server, whether by FTP or shell
account. Generally, you will point your FTP server to the Web
server itself (for instance, www.isp.com), unless the provider
has created a mirror site to avoid direct access to the
Web server.
Note |
A mirror site is generally an exact replica of a Web server's hard disk, but it is kept separate for security reasons. For instance, you might not be able to directly access your company's Web site files-but you can change a mirror of that Web server, and your changes will be handled by knowledgeable Internet specialists. Many companies prefer to isolate their Web servers from their corporate network so that important data is impossible to access from outside the company. "Mirror" is more generally used to represent any more-or-less exact copy of an Internet server. The FTP site mirrors.aol.com, for instance, offers copies of nearly every shareware file available on other popular FTP servers around the world. This gives more users access to the same files at the same time. |
After clearing the security procedure, you'll most likely be in
your personal Web site's main directory. (If not, you'll need
to use the cd command in
UNIX or otherwise change directories in your FTP program.) From
that point, you can update files using the Put command. Simply
upload the updated files with the same names as the outdated files-in
nearly every case, the old files will simply be overwritten. If
you're using new files, upload them using the names and paths
that your Web page links use to refer to them.
Tip |
It's a good idea to maintain a folder or directory on your own hard drive that is as identical as possible to the Web site you make available on a server-so you can test your organization and filenames. |
Before you can start showing the world your HTML pages, you'll need to find a Web server where you can store them. This server can be a corporate server, an Internet service provider, or a computer you maintain yourself. In any case, it needs to run Web server software and have a high-speed Internet connection.
Once you've established where you're going to put your HTML files, you need to decide how you'll organize them. There are four basic ways to do this: in one directory, in directories organized by functions, in directories organized by file type, or a hybrid of the latter two. For larger sites, a hybrid is most effective.
An important part of your Web site organization is the way you name files. The best way to do this is to be as descriptive as possible, while using name prefixes that best describe what Web pages are used to access these files. This will also help immensely when it's time to troubleshoot your Web site or update some of the files.