Archie (search engine)

Archie is a tool for indexing FTP archives, allowing users to more easily identify specific files. It is considered the first Internet search engine. The original implementation was written in 1990 by Alan Emtage, then a postgraduate student at McGill University in Montreal, Canada. Archie was superseded by other, more sophisticated search engines, including Jughead and Veronica, which were search engines for the Gopher protocol. These were in turn superseded by World Wide Web search engines like AltaVista and directories like Yahoo! in 1995. Work on Archie ceased in the late 1990s. A legacy Archie server was maintained for historic purposes in Poland at Interdisciplinary Centre for Mathematical and Computational Modelling in the University of Warsaw until 2023.

Archie
Screenshot of Archie
Original authorAlan Emtage
DevelopersBunyip Information Systems, Inc.
Initial release10 September 1990; 35 years ago (1990-09-10)
Final release
3.5 / 1996
Written inC
Operating systemSolaris, AIX
TypeSearch engine
Websitebunyip.com/products/archie/ (original product page, archived)
archie.serialport.org (online instance, temporarily offline)
Internet history timeline

Early research and development:

Merging the networks and creating the Internet:

Commercialization, privatization, broader access leads to the modern Internet:

  • 2003 (2003): National LambdaRail founded
  • 2004 (2004): UN Working Group on Internet Governance (WGIG)
  • 2005 (2005): UN WSIS phase II
  • 2006 (2006): First meeting of the Internet Governance Forum
  • 2010 (2010): First internationalized country code top-level domains registered
  • 2012 (2012): ICANN begins accepting applications for new generic top-level domain names
  • 2013 (2013): Montevideo Statement on the Future of Internet Cooperation
  • 2014 (2014): NetMundial international Internet governance proposal
  • 2016 (2016): ICANN contract with U.S. Dept. of Commerce ends, IANA oversight passes to the global Internet community on October 1st

Examples of Internet services:

  • 1989 (1989): AOL dial-up service provider, email, instant messaging, and web browser
  • 1990 (1990): IMDb Internet movie database
  • 1994 (1994): Yahoo! web directory
  • 1995 (1995): Amazon online retailer
  • 1995 (1995): eBay online auction and shopping
  • 1995 (1995): Craigslist classified advertisements
  • 1995 (1995): AltaVista search engine
  • 1996 (1996): Outlook (formerly Hotmail) free web-based e-mail
  • 1996 (1996): RankDex search engine
  • 1997 (1997): Google Search
  • 1997 (1997): Babel Fish automatic translation
  • 1998 (1998): Yahoo Groups (formerly Yahoo! Clubs)
  • 1998 (1998): PayPal Internet payment system
  • 1998 (1998): Rotten Tomatoes review aggregator
  • 1999 (1999): 2ch Anonymous textboard
  • 1999 (1999): i-mode mobile internet service
  • 1999 (1999): Napster peer-to-peer file sharing
  • 2000 (2000): Baidu search engine
  • 2001 (2001): 2chan Anonymous imageboard
  • 2001 (2001): BitTorrent peer-to-peer file sharing
  • 2001 (2001): Wikipedia, the free encyclopedia
  • 2003 (2003): LinkedIn business networking
  • 2003 (2003): Myspace social networking site
  • 2003 (2003): Skype Internet voice calls
  • 2003 (2003): iTunes Store
  • 2003 (2003): 4chan Anonymous imageboard
  • 2003 (2003): The Pirate Bay, torrent file host
  • 2004 (2004): Facebook social networking site
  • 2004 (2004): Podcast media file series
  • 2004 (2004): Flickr image hosting
  • 2005 (2005): YouTube video sharing
  • 2005 (2005): Reddit link voting
  • 2005 (2005): Google Earth virtual globe
  • 2006 (2006): Twitter microblogging
  • 2007 (2007): WikiLeaks anonymous news and information leaks
  • 2007 (2007): Google Street View
  • 2007 (2007): Kindle, e-reader and virtual bookshop
  • 2008 (2008): Amazon Elastic Compute Cloud (EC2)
  • 2008 (2008): Dropbox cloud-based file hosting
  • 2008 (2008): Encyclopedia of Life, a collaborative encyclopedia intended to document all living species
  • 2008 (2008): Spotify, a DRM-based music streaming service
  • 2009 (2009): Bing search engine
  • 2009 (2009): Google Docs, Web-based word processor, spreadsheet, presentation, form, and data storage service
  • 2009 (2009): Kickstarter, a threshold pledge system
  • 2009 (2009): Bitcoin, a digital currency
  • 2010 (2010): Instagram, photo sharing and social networking
  • 2011 (2011): Google+, social networking
  • 2011 (2011): Snapchat, photo sharing
  • 2012 (2012): Coursera, massive open online courses
  • 2016 (2016): TikTok, video sharing and social networking

With assistance from the University of Warsaw, a new Archie server was created and opened for public access at The Serial Port, a web-based computer museum, on 11 May 2024.

Origin

Archie first appeared in 1986, while Emtage was the systems manager at the McGill University School of Computer Science. His predecessor had attempted to persuade the institution to connect to the Internet, but due to the expensive cost — roughly $35,000 per year for a sluggish link to Boston — it had been challenging to persuade the appropriate parties that the investment was worthwhile.

The name derives from the word "archive" without the 'v'. Emtage has said that contrary to popular belief, there was no association with the Archie Comics. Despite this, other early Internet search technologies such as Jughead and Veronica were named after characters from the comics. Anarchie, one of the earliest graphical FTP clients, was named for its ability to perform Archie searches.

Function

The earliest versions of Archie would simply search a list of public anonymous File Transfer Protocol (FTP) sites using the Telnet protocol and create index files available via FTP. To view the contents of a file, it had first to be downloaded. The indexes are updated on a regular basis (contacting each roughly once a month, so as not to waste too many resources of the remote servers) by requesting a listing. These listings were stored in local files to be searched using the Unix grep command.

The developers populated the engine's servers with databases of anonymous FTP host directories. This was used to find specific file titles since the list was plugged in to a searchable database of FTP sites. Archie did not recognize natural language requests nor index the content inside the files. Therefore, users had to know the title of the file they wanted. The ability to index the content inside the files was later introduced by Gopher.

Development

Emtage and Heelan wrote a script allowing people to log in and search collected information using the Telnet protocol at the host "archie.mcgill.ca" [132.206.2.3]. Later, more efficient front- and back-ends were developed, and the system spread from a local tool to a network-wide resource and a popular service available from multiple sites around the Internet. The collected data would be exchanged between the neighbouring Archie servers. The servers could be accessed in multiple ways: using a local client (such as archie or xarchie); telnetting to a server directly; sending queries by electronic mail; and later via a World Wide Web interface. At the peak of its popularity, the Archie search engine accounted for 50% of Montreal Internet traffic.

In 1992, Emtage, along with J. Peter Deutsch and some financial help from McGill University, formed Bunyip Information Systems with a licensed commercial version of the Archie search engine used by millions of people worldwide. Heelan followed them into Bunyip soon after, where he together with Bibi Ali and Sandro Mazzucato significantly updated the Archie database and indexed web pages. Work on the search engine ceased in the late 1990s, and the company dissolved in 2003.

See also

  • Alan Emtage
  • Jughead
  • Veronica
  • Wide area information server

Further reading

  • Archie—A Darwinian Development Process. Peter Deutsch. IEEE Internet Computing, January/February 2000, 4(1):69-71. Part of Millennial Forecasts, doi:10.1109/4236.815865.
  • P. Deutsch, A. Emtage, A. Marine, How to Use Anonymous FTP (RFC1635, May 1994)

wikipedia, wiki, encyclopedia, book, library, article, read, free download, Information about Archie (search engine), What is Archie (search engine)? What does Archie (search engine) mean?