Sitescooper Current Sites

These are the latest versions of the site files distributed with sitescooper. You can also download each site file independently from here.
Number Site File Category Site Name
1sitescooper_archive.siteadmin Sitescooper Archive
2sitescooper_changes.siteadmin Sitescooper Latest Changes
3bsdtoday.sitebsd BSD Today
4openbsd_journal.sitebsd OpenBSD Journal
5oreillynet_bsd.sitebsd O'Reilly Net BSD
6businessweek.sitebusiness BusinessWeek Online
7cnn_financial.sitebusiness CNN Financial
8fuckedcompany.sitebusiness Fucked Company
9industry_week.sitebusiness Industry Week
10motley-fool.sitebusiness The Motley Fool
11the_economist.sitebusiness Economist
12lazarus_at_large.sitebusiness Lazarus at Large
13darkhorizons.sitecinema Dark Horizons
14ebert_1min.sitecinema Roger Ebert One-Minute Reviews
15ebert_answer_man.sitecinema Roger Ebert: Movie Answer Man
16ebert_features.sitecinema Roger Ebert: Interviews-essays-festivals
17ebert_great_movies.sitecinema Roger Ebert: The Great Movies
18filthy_critic.sitecinema The Filthy Critic
19imdb_studio_briefing.sitecinema IMDB Movie/TV news
20roger_ebert.sitecinema Roger Ebert Reviews
21variety.sitecinema Variety.Com
22apartment_3g.sitecomics Apartment 3-G
23baby_blues.sitecomics Baby Blues
24barney_google_and_snuffy_smith.sitecomics Barney Google and Snuffy Smith
25beetle_bailey.sitecomics Beetle Bailey
26better_half.sitecomics The Better Half
27between_friends.sitecomics Between Friends
28blondie.sitecomics Blondie
29boondocks.sitecomics Boondocks
30buckles.sitecomics Buckles
31calvin_and_hobbes.sitecomics Calvin and Hobbes
32crock.sitecomics Crock
33curtis.sitecomics Curtis
34dennis_the_menace.sitecomics Dennis the Menace
35dilbert.sitecomics Dilbert
36dinette_set.sitecomics The Dinette Set
37doonesbury.sitecomics Doonesbury
38edge_city.sitecomics Edge City
39family_circus.sitecomics Family Circus
40flash_gordon.sitecomics Flash Gordon
41funky_winkerbean.sitecomics Funky Winkerbean
42grin_and_bear_it.sitecomics Grin and Bear It
43hagar_the_horrible.sitecomics Hagar the Horrible
44hazel.sitecomics Hazel
45henry.sitecomics Henry
46hi_and_lois.sitecomics Hi and Lois
47judge_parker.sitecomics Judge Parker
48katzenjammer_kids.sitecomics The Katzenjammer Kids
49lockhorns.sitecomics The Lockhorns
50mallard_fillmore.sitecomics Mallard Fillmore
51mandrake_the_magician.sitecomics Mandrake the Magician
52mark_trail.sitecomics Mark Trail
53marvin.sitecomics Marvin
54mary_worth.sitecomics Mary Worth
55moose_and_molly.sitecomics Moose and Molly
56mutts.sitecomics Mutts
57norm.sitecomics The Norm
58on_the_fastrack.sitecomics On The Fastrack
59phantom.sitecomics The Phantom
60piranha_club.sitecomics The Piranha Club
61popeye.sitecomics Popeye
62prince_valiant.sitecomics Prince Valiant
63redeye.sitecomics Redeye
64rex_morgan_md.sitecomics Rex Morgan M.D.
65rhymes_with_orange.sitecomics Rhymes With Orange
66safe_havens.sitecomics Safe Havens
67sally_forth.sitecomics Sally Forth
68sam_and_silo.sitecomics Sam and Silo
69shermans_lagoon.sitecomics Sherman's Lagoon
70six_chix.sitecomics Six Chix
71slylock_fox.sitecomics Slylock Fox
72spiderman.sitecomics The Amazing Spiderman
73steve_roper_and_mike_nomad.sitecomics Steve Roper and Mike Nomad
74tedrall.sitecomics Ted Rall
75theyll_do_it_every_time.sitecomics They'll Do It Every Time
76thismodernworld.sitecomics This Modern World
77tiger.sitecomics Tiger
78trudy.sitecomics Trudy
79tumbleweeds.sitecomics Tumbleweeds
80user_friendly.sitecomics User Friendly
81zippy_the_pinhead.sitecomics Zippy The Pinhead
82zits.sitecomics Zits
83world_new_york.siteculture World New York
84oracularities.sitefortune The Internet Oracle
85wingmail.sitefortune Wingmail Daily
86gamasutra_features.sitegames GamaSutra
87gamedev_net.sitegames GameDev.net
88happypenguin.sitegames Linux Game Tome
89oldmanmurray.sitegames Old Man Murray
90bofh-2k+1.sitehumor 2001: A BOFH Odyssey
91bofh-2k.sitehumor BOFH 2K: The Kit and caboodle
92bofh.sitehumor BOFH
93bofh_archive.sitehumor Bastard Operator from Hell
94dave_barry.sitehumor Dave Barry
95jon_carroll.sitehumor Jon Carroll
96pigdog.sitehumor Pigdog Journal
97satirewire.sitehumor SatireWire
98the_onion.sitehumor The Onion
99javaworld.sitelanguages JavaWorld
100merlyns_columns.sitelanguages Randal Schwartz' columns
101php_net.sitelanguages PHP.net
102use_perl.sitelanguages use Perl
103layouts.sitelib (unknown)
104zen_stories.sitelifestyle Zen stories
105advogato.sitelinux Advogato
106advogato_diaries.sitelinux Advogato Diaries
107alan_cox_diary.sitelinux Alan Cox Diary
108debian_weekly_news.sitelinux Debian Weekly News
109desktoplinux.sitelinux DesktopLinux
110footnotes.sitelinux Gnome FootNotes
111freshmeat.sitelinux Freshmeat
112gwn.sitelinux Gentoo Weekly Newsletter
113kc_kde.sitelinux KC - KDE
114kde-news.sitelinux KDE news
115kernel_cousin_debian.sitelinux Debian Kernel Cousin
116kernel_traffic.sitelinux Kernel Traffic
117kerneltrap.sitelinux kerneltrap.com
118linux_gazette.sitelinux Linux Gazette
119linux_magazine.sitelinux Linux Magazine
120linuxdevices.sitelinux LinuxDevices.com
121slashdot.sitelinux SlashDot
122a_word_a_day.sitemisc A.Word.A.Day
123drinkboy.sitemisc The Drinkboy Channel
124world_wide_words.sitemisc World Wide Words
125wired_news_business.sitenews Wired News Business
126wired_news_culture.sitenews Wired News Culture
127wired_news_politics.sitenews Wired News Politics
128wired_news_tech.sitenews Wired News Technology
129USNews.sitenews USNews-Tue
130atlantic.sitenews The Atlantic
131cnn_mobile.sitenews CNN Mobile
132newsweek.sitenews Newsweek
133newsweek_intl.sitenews NewsweekIntl-Tue
134usa_today.sitenews USA Today
135yahoo_business.sitenews Yahoo! Business
136yahoo_entertainment.sitenews Yahoo! Entertainment
137yahoo_politics.sitenews Yahoo! Politics
138yahoo_tech.sitenews Yahoo! Tech
139yahoo_top_stories.sitenews Yahoo! Top Stories
140blather.siteodd Blather
141davenet.siteopinion DaveNet
142i_cringely.siteopinion I, Cringely
143nro.siteopinion National Review Online
144pulpit.siteopinion The Pulpit
145roving_reporter.siteopinion the roving_reporter
146salon.siteopinion Salon Magazine
147suck.siteopinion Suck.com
148slate.siteopinion Slate
149alanmiller.siteopinion Alan Miller
150palm_boulevard.sitepalm Palm Boulevard
151palmpilotsoftware.sitepalm PalmPilot Software
152palmpower.sitepalm PalmPower
153palmstation.sitepalm PalmStation.Com
154visorcentral_discussion.sitepalm VisorCentral Discussion
155visorcentral_mobile.sitepalm VisorCentral Mobile
156la_lettre_edition_mobile.sitepalmsized La Lettre de l'Internet
157motley_fool.sitepalmsized The Motley Fool - News
158the_guardian_palmsized.sitepalmsized UK Guardian
159the_onion_pda.sitepalmsized The Onion
160the_register_rss.sitepalmsized The Register RSS
161inq7-mobile.sitepalmsized INQ7 mobile
162movietickets.sitepersonalized Movie Showtimes
163my_yahoo.sitepersonalized My Yahoo
164sydney_morning_herald.siteregional_australia Sydney Morning Herald
165yourmovies_canberra.siteregional_australia YourMovies Canberra
166bostonglobe.siteregional_boston Boston_Globe
167la_times_frontpage.siteregional_california LA Times Front Page
168bayarea_com_news.siteregional_california BayArea.com News
169sf_chronicle_food.siteregional_california SF Chronicle Food
170sfgate_com_news.siteregional_california SFGate.com News
171chicago_tribune_business.siteregional_chicago Trib Business
172chicago_tribune_front_page.siteregional_chicago Trib Front Page
173chicago_tribune_sports.siteregional_chicago Trib Sports
174Vecernji.siteregional_croatia Vecernji List
175accuweather_zagreb.siteregional_croatia Accuweather - Zagreb
176berlingsketidende.siteregional_denmark Berlingske
177computerworld.dk.siteregional_denmark Computerworld DK
178dmi-vejret.siteregional_denmark DMIs vejrudsigt
179geekculture.siteregional_denmark GeekCulture.dk
180ingeniøren.siteregional_denmark Ingeniøren
181politiken_daily_summary.siteregional_denmark Politiken: summary
182sslug-kalender.siteregional_denmark SSLUG kalender
183LeMonde1_INT_FRA_STE_REG.siteregional_francais Le Monde International France Société Régions
184LeMonde2_HORIZONS.siteregional_francais Le Monde HORIZONS
185LeMonde3_ENT_COM_PLA_ECO.siteregional_francais Le Monde Entreprise Communication Placements Economie
186LeMonde4_AUJ_SCI_SPO_CULT.siteregional_francais Le Monde Aujourd'hui Sciences Sports Culture
187LeMonde5_LIVRES.siteregional_francais Le Monde des Livres
188LeMonde6_Interactif.siteregional_francais Le Monde Interactif
189LeMonde7_UNE.siteregional_francais Le Monde Accueil Pierre Georges Liens
190LeMonde_AutoMoto.siteregional_francais Le Monde - AutoMoto
191journaldunet.siteregional_francais Le_Journal_du_Net
192journaldunet_dossiers.siteregional_francais Net 20
193la_tribune.siteregional_francais LaTribune
194le_monde_full.siteregional_francais le_monde_edition_electronique
195multimedium.siteregional_francais Multimédium
196nouvelobs.siteregional_francais Le Nouvel Observateur
197de_sz.siteregional_germany Sueddeutsche
198de_sz_bayern.siteregional_germany SZ Bayern
199de_sz_berlin.siteregional_germany SZ Berlin
200de_sz_beruf.siteregional_germany SZ Bildung & Beruf
201de_sz_drei.siteregional_germany SZ Seite Drei
202de_sz_feuilleton.siteregional_germany SZ Feuilleton
203de_sz_hochschule.siteregional_germany SZ Hochschulseite
204de_sz_immobilien.siteregional_germany SZ Immobilienseite
205de_sz_kultur.siteregional_germany SZ Münchner Kultur
206de_sz_literatur.siteregional_germany SZ Literatur
207de_sz_medien.siteregional_germany SZ Medien
208de_sz_meinung.siteregional_germany SZ Meinungsseite
209de_sz_muenchen.siteregional_germany SZ München
210de_sz_panorama.siteregional_germany SZ Panorama
211de_sz_politik.siteregional_germany SZ Politik
212de_sz_reise.siteregional_germany SZ Reise & Erholung
213de_sz_sonder.siteregional_germany SZ Sonderseiten
214de_sz_sonderbeilage.siteregional_germany SZ Sonderbeilage
215de_sz_sport.siteregional_germany SZ Sport
216de_sz_streiflicht.siteregional_germany SZ Streiflicht
217de_sz_verkehr.siteregional_germany SZ Auto & Verkehr
218de_sz_wirtschaft.siteregional_germany SZ Wirtschaft
219de_sz_wissen.siteregional_germany SZ Wissenschaft
220de_sz_wochenende.siteregional_germany SZ am Wochenende
221de_cert.siteregional_germany CERT RUS
222de_computerwoche.siteregional_germany Computerwoche
223de_cyberkino.siteregional_germany Cyberkino
224de_der_pocketstandard.siteregional_germany Der PocketStandard
225de_fool.siteregional_germany MotleyFool DE
226de_gazette.siteregional_germany Die Gazette
227de_gnn.siteregional_germany GNN
228de_heise.siteregional_germany Heise Newsticker
229de_heise_mobil.siteregional_germany Heise Mobil
230de_heise_tp.siteregional_germany Heise Telepolis
231de_onlinekosten.siteregional_germany Onlinekosten.de
232de_pdassi_news.siteregional_germany pdassi News
233de_pdassi_software.siteregional_germany pdassi Software
234de_spiegel.siteregional_germany Der Spiegel
235de_stern.siteregional_germany Stern
236de_tagesschau.siteregional_germany Tagesschau Mobil
237de_tecchannel.siteregional_germany tecChannel
238de_teltarif.siteregional_germany Teltarif
239de_tvspielfilm.siteregional_germany TV-Spielfilm
240de_welt.siteregional_germany Die Welt
241de_yahoo.siteregional_germany Yahoo News DE
242mobile2day.siteregional_germany mobile2day
243palmfaq_de.siteregional_germany PalmFAQ.de
244pda_debitel_net.siteregional_germany debitel.net Mobile Portal
245windows2000faq.siteregional_germany Windows2000 FAQ
246zdnet_news.siteregional_germany ZDNet News
247de_zeit.siteregional_germany Zeit
248de_zeit_alternate.siteregional_germany Zeit
249de_zeit_kultur.siteregional_germany Zeit Kultur
250de_zeit_leben.siteregional_germany Zeit Leben
251de_zeit_politik.siteregional_germany Zeit Politik
252de_zeit_reisen.siteregional_germany Zeit Reisen
253de_zeit_wirtschaft.siteregional_germany Zeit Wirtschaft
254de_zeit_wissen.siteregional_germany Zeit Wissen
255freebsd_hu.siteregional_hungary FreeBSD.hu
256hup_hu.siteregional_hungary HUP
257linux_hu.siteregional_hungary Linux.hu
258linuxforum_hu.siteregional_hungary Linuxforum
259linuxonline_hu.siteregional_hungary LinuxOnline
260metro_hu.siteregional_hungary Metro
261pdamania_hu.siteregional_hungary PDAMania.hu
262terminal_hu.siteregional_hungary Terminal.hu
263accuweather_dublin.siteregional_ireland Accuweather Dublin
264evilgerald.siteregional_ireland The Evil Gerald
265hackwatch.siteregional_ireland Hack Watch News
266irish_aertel_listings.siteregional_ireland Aertel TV Listings
267linux_ie.siteregional_ireland Linux.ie
268rte_news_online.siteregional_ireland RTE News Online
269volta_netgains.siteregional_ireland Volta NetGains
270jerusalem_post.siteregional_israel JPost
271haaretz.siteregional_israel Haaretz
272jpost-columns.siteregional_israel JPost-columns
273jpost-international.siteregional_israel JPost-international
274jpost-israel.siteregional_israel JPost-Israel
275jpost-me.siteregional_israel JPost-ME
276jpost-opinion.siteregional_israel JPost-opinion
277jp_japan_times_business.siteregional_japan Japan Times Business
278jp_japan_times_news.siteregional_japan Japan Times News
279jp_daily_yomiuri_english.siteregional_japan Daily Yomiuri English
280ny_post.siteregional_new_york New York Post
281christchurch_press.siteregional_new_zealand Christchurch Press
282gist_tv.siteregional_north_carolina GIST TV Listings
283whyytv12.siteregional_philadelphia WHYY Philadelphia TV12/91FM
284ctc-movies-metro.siteregional_philippines ClickTheCity.com - Metro Manila Movie Guide
285inq7.siteregional_philippines INQ7 Express
286seattle_p_i.siteregional_seattle Seattle P-I
287elmundo_culture.siteregional_spain El Mundo Cultura
288elmundo_economy.siteregional_spain El Mundo Economia
289elmundo_europe.siteregional_spain El Mundo Europa
290elmundo_international.siteregional_spain El Mundo Internacional
291elmundo_national.siteregional_spain El Mundo Nacional
292elmundo_society.siteregional_spain El Mundo Sociedad
293elmundo_sports.siteregional_spain El Mundo Deportes
294le_temps.siteregional_switzerland Le Temps
295globe_and_mail_columnists.siteregional_toronto G&M Columnists
296globe_and_mail_national.siteregional_toronto G&M National
297globe_and_mail_thearts.siteregional_toronto G&M The Arts
298globe_and_mail_toronto.siteregional_toronto G&M Toronto
299bbc_news_front.siteregional_uk BBC Front Page
300bbc_news_health.siteregional_uk BBC News Health
301bbc_news_sci_tech.siteregional_uk BBC News Sci-Tech
302bbc_news_world.siteregional_uk BBC World News
303the_guardian.siteregional_uk UK Guardian
304gabriels_mobile_channel.sitereligion Gabriels Channel
305scifiwire.sitesci_fi SciFi Wire
306archaeology_org.sitescience Archaeology Org News
307explorezone.sitescience ExploreZone
308grahamhancock.sitescience Hancock
309new_scientist.sitescience New Scientist
310new_scientist_news.sitescience New Scientist News
311science_daily.sitescience Science Daily Headlines
312smithsonian.sitescience Smithsonian
313spaceref.sitescience SpaceRef.com
314crypto_gram.sitesecurity Crypto-Gram
315cryptome.sitesecurity Cryptome
316GSR_Appearance_Mods.sitesport GSR Appearance Mods
317GSR_Bike.sitesport GSR Bike of The Month
318GSR_General_Disc.sitesport GSR General Discussion
319GSR_Owners.sitesport GSR Owners
320GSR_Performance_Mods.sitesport GSR Performance Mods
321GSR_Stories.sitesport GSR Stories Forum
322GSR_Technical.sitesport GSR Technical Forum
323GSR_Tips-n-Tricks.sitesport GSR Tips & Tricks
324cnn_sports.sitesport CNN Sports
325mobilebikes.sitesport MobileBikes
326yahoo_sport_news.sitesport Yahoo! Sports News
327anandtech.sitetech AnandTech
328ars_technica.sitetech Ars Technica
329computer_world.sitetech ComputerWorld
330firstmonday.sitetech First Monday
331infoworld.sitetech InfoWorld to Go
332joelonsoftware.sitetech Joel on Software
333newsforge.sitetech NewsForge
334oreillynet_features.sitetech O'ReillyNet Features
335os_opinion.sitetech OS Opinion
336pcmag_images.sitetech PCMagazine-BiWed
337risks.sitetech comp.risks
338slashdot_top.sitetech SlashDot Top
339slyck.sitetech Slyck
340techdirt.sitetech TechDirt
341the_register.sitetech The Register
342wiredmag.sitetech Wired
343xmlhack.sitetech XMLHack
344zzz.sitetech ZZZ Online
345paulgraham.sitetech Paul Graham
346pcmag_firstlooks.sitetech PCMag-1stLooks
347tvguide.sitetv TVGEN
348freshmeat_articles.siteunix Freshmeat Articles
349rootprompt.siteunix RootPrompt.org
350samba_traffic.siteunix Kernel Traffic - Samba
351wine_traffic.siteunix KC - Wine
352iwin.siteweather IWIN Weather
353nrcc_northeast_forecast.siteweather NRCC Forecasts for Northeastern US
354wu_new_mexico.siteweather Weather - New Mexico
355wu_redmond.siteweather Weather - Redmond
356alertbox.siteweb Alertbox
357jon_udell.siteweb Jon Udells Articles
358mappa_mundi.siteweb Mappa.Mundi
359mozillazine.siteweb MozillaZine
360researchbuzz.siteweb ResearchBuzz
361searchenginereport.siteweb Search Engine Report
362bifurcated_rivets.siteweblog Bifurcated Rivets
363boingboing.siteweblog Boing Boing
364camworld.siteweblog CamWorld
365crummy.siteweblog Crummy
366doc_searls.siteweblog Doc Searls Weblog
367eckes.siteweblog Eckes.org - Opinions of some Geeks
368ethel_the_blog.siteweblog Ethel The Blog
369flutterby.siteweblog Flutterby
370genehack.siteweblog GeneHack
371hack_the_planet.siteweblog Hack The Planet
372honeyguide.siteweblog Honeyguide
373jason_pettus.siteweblog Jason Pettus
374memepool.siteweblog Memepool
375monkeyfist.siteweblog Monkeyfist
376mydog.siteweblog my dog wants to be on the radio
377ntk.siteweblog NTKnow
378peterme.siteweblog PeterMe
379rathergood.siteweblog rathergood.com
380rc3.siteweblog RC3
381riverbend.siteweblog Riverbend
382robot_wisdom.siteweblog Robot Wisdom
383scripting_news.siteweblog Scripting News
384tim_oreilly.siteweblog Tim O'Reilly's Weblog
385tomalaks_realm.siteweblog Tomalaks Realm
386where_is_raed.siteweblog WhereIsRaed
387kevin_sites.siteweblog Iraq War Blog - K. Sites

Category: CVS


Category: admin


sitescooper_archive.site:

# This is a sitescooper site file. see http://sitescooper.tsx.org/
# by Stefan Schwingeler, Version 0.1, 09.11.1999
# Thanks Stefan!
#
URL: http://groups.yahoo.com/group/sitescooper-archive/
  Name: Sitescooper Archive
  Levels: 2
  ContentsStart: <font.*>date</font>
  ContentsEnd: </table>
  ContentsCachable: 0
  StoryURL: http://groups.yahoo.com/group/sitescooper-archive/\d+\.html\?
  StoryStart: Subject: 
  StoryEnd: alt="Previous"
  StoryCacheable: 1
# rm center
  StoryPostProcess: {
    s/v?align=center//gim;
  }


sitescooper_changes.site:

URL: http://sitescooper.org/devel/LATEST_CHANGES.html
Name: Sitescooper Latest Changes
Description: the Sitescooper development change log

Levels: 1
StoryDiff: 1
UseTableSmarts: 0
TableRender: flatten


Category: bsd


bsdtoday.site:

URL:		http://www.bsdtoday.com/
Name:		BSD Today
Description:	Your Daily Source for BSD News and Information
Levels:		2
UseTableSmarts:	0
TableRender:	flatten

ContentsStart:	<img src="/images/black.gif" width="1" height="550">
ContentsEnd:	<b>Resources</b><br>

StoryURL:	http://www.bsdtoday.com/\d+/.*\d+.html
StoryStart:	<img src="/images/black.gif" width="1" height="550">
StoryEnd:	<b>Please share your comments.</b>


openbsd_journal.site:

URL: http://undeadly.org/
Name: OpenBSD Journal
Levels: 2

AuthorName: Barry Dexter A. Gonzaga
AuthorEMail: barryg-sitescooper /at/ kssp.upd.edu.ph

StoryURL: .*action=article.*
StoryToPrintableSub: s/(sid=\d+$)/\1\&mode=flat/
ContentsStart: About :
ContentsEnd: <b>Features</b>


oreillynet_bsd.site:

URL: http://www.onlamp.com/bsd/
Name: O'Reilly Net BSD
Levels: 2

ContentsStart: --  BSD Lede  --
ContentsEnd: --  digest  --

StoryURL: /pub/a/bsd/[[YYYY]]/\d+/\d+/\S+.html(|\?page=\d+)
StoryStart: --  content here  --
StoryEnd: --  footer area  --
StoryFollowLinks: 1


Category: business


businessweek.site:

URL: http://pda.businessweek.com/index.htm
Name: BusinessWeek Online
Levels: 3

AuthorName: Barry Dexter A. Gonzaga
AuthorEMail: barryg-sitescooper /at/ kssp.upd.edu.ph

StoryURL: .*\.(htm|html)
StoryURL: /list/.*\.htm
StoryURL: /.*/.*/.*/.*\.htm
StorySkipURL: /ads/contents.htm
ImageURL: /common_images/.*\.gif


cnn_financial.site:

# CNN Financial
URL: http://wireless.cnn.com/avantgo/CNNMONEY/en/channel.html
# created from PODS file by David A. Desrosiers
AuthorName: Marko Bozikovic <marko.bozikovic /at/ envox.hr>
Name: CNN Financial
Levels: 2

ImageURL: .*\.gif
ImageScaleToMaxWidth: 150
ContentsCachable: 0

StoryURL: http://wireless.cnn.com/avantgo/CNNMONEY/en/stories/.*
StoryCachable: 1



fuckedcompany.site:

# AuthorName: jm
#
# I love this site, just for the author's pure schadenfreude!
#
URL: http://www.fuckedcompany.com/
Name: Fucked Company
Description: the dot-com deadpool

StoryStart: <img src="images/recent_groove.gif" width=402 height=2></td>
StoryEnd: <td>&nbsp;&nbsp;<a href="archives">View more headlines</a></td>


industry_week.site:

URL: http://www.industryweek.com/avantgo/
Name: Industry Week
Levels: 2
ContentsPrint: 1
ImageURL: http://.*
#
# This site was converted from an AvantGo .subs file by subs-to-site.pl.
# See http://sitescooper.org/ for more information on sitescooper.


motley-fool.site:

URL:		http://www.fool.com/xml/foolnews_rss091.xml
Name:		The Motley Fool
Description:	To Educate, Amuse, and Enrich
ContentsFormat:	rss

StoryURL:	/.*\.htm
StoryEnd: <A NAME="NUMBERS">
StoryStart: <BODY 

# as dictated in http://www.fool.com/help/FoolsRules.htm
Rights:		Copyright 1996-2000 The Motley Fool. All rights reserved.

MinPages:	2


the_economist.site:

URL: http://www.economist.com/index.html?nonNA=1
Name: Economist
Description: Economist
AuthorName: Goh Boon Nam
# Version 1.2
# Date updated : 30 Dec 2004
# Changes made : Change of URL + Remove Subscription-only pages which cause problem to Plucker

Levels: 2

ContentsStart: <td colspan="7" width="447" valign="top">
ContentsEnd: Only one answer is correct

ContentsUseTableSmarts: 0

StoryToPrintableSub: s!displayStory.cfm!PrinterFriendly.cfm!

StoryURL: http://www.economist.com/(.*?)/PrinterFriendly.cfm(.*?)

#This image is the icon to indicate story not available
ImageURL: http://www.economist.com/images/dingbats/e5.gif

ContentsHTMLPreProcess: {
    s/align="right"//gim;
    s/align="center"//gim;
    s/align=right//gim;   
    s/align=center//gim;
  }

StoryHTMLPreProcess: {
    s/align="right"//gim;
    s/align="center"//gim;
    s/align=right//gim;   
    s/align=center//gim;
    s/<div id="wholepage" style="visibility: hidden">(.*?)<\/noscript>//gis;
   }





lazarus_at_large.site:

# site_samples/business/lazarus_at_large.site
#
# SF Chronicle Columnists : David Lazarus, "Lazarus at Large"
# by Akkana Peck

URL:		http://sfgate.com/cgi-bin/search/columnists.cgi?waisdbname=/chronicle/&byline=David+Lazarus
Name:		Lazarus at Large
Levels:		2

ContentsStart: <INPUT TYPE="submit" VALUE="View Archive">
ContentsDiff: 1

StoryURL: http://sfgate.com/cgi-bin/article.cgi.*

StoryStart: <!-- end #additionalcontent -->
StoryEnd: <!-- END STORY -->


Category: cinema


darkhorizons.site:

# Author: MMiller /at/ media-general.com

URL: http://www.darkhorizons.com/news-n.htm
  Name: Dark Horizons
  Levels: 1
  ContentsStart: UPDATE:
  ContentsEnd: HR WIDTH=40%
  StoryCacheable: 0
  ContentsDiff: 0


ebert_1min.site:

# roger_ebert_1min.site
# AuthorName: Alan Hoyle <alan@alanhoyle.com>
#
# Reads the Roger Ebert One Minute Movie Reviews

URL: http://www.suntimes.com/output/minmovie/movie0.html
Name: Roger Ebert One-Minute Reviews
Description: Roger Ebert's One-Minute Movie Reviews
Levels: 2
Category: Daily

ContentsStart: Begin Content
ContentsEnd:  End Content
StoryURL: .*ebert_reviews/.*\.html
StoryCacheable: 1

StoryHeadline: <h2>(.*)</h2>
StoryStart: Begin Review
StoryEnd: End Content

Rights: Copyright &copy; Chicago Sun-Times Inc.


ebert_answer_man.site:

# ebert_answer_man.site
# Roger Ebert's Movie Answer Man weekly Q&A column

URL: http://www.suntimes.com/index/answ-man.html
Name: Roger Ebert: Movie Answer Man
Description: Roger Ebert's Movie Answer Man weekly Q&A column
Levels: 2

ContentsStart: <!-- Begin Content -->
StoryURL: .*answ-man/.*\.html
StoryCacheable: 1

StoryHeadline: <h2>(.*)</h2>
StoryStart: <!-- Begin Content --> 
StoryEnd: <!-- End Content -->


ebert_features.site:

# ebert_features.site
# Roger Ebert's Movie Feature Articles

URL: http://www.suntimes.com/index/ebfeatures.html
Name: Roger Ebert: Interviews-essays-festivals
Description: Roger Ebert's movie feature articles
Levels: 2

ContentsStart: <!-- Begin Content -->
StoryURL: .*eb-feature/.*\.html
StoryCacheable: 1

StoryHeadline: <h2>(.*)</h2>
StoryStart: <!-- Begin Content --> 
StoryEnd: <!-- End Content -->


ebert_great_movies.site:

# ebert_great_movies.site
# Roger Ebert's "The Great Movies"

URL: http://www.suntimes.com/ebert/greatmovies/index.html
Name: Roger Ebert: The Great Movies
Description: Roger Ebert's regular "The Great Movies" feature
Levels: 2

ContentsStart: <!-- Begin Content -->
ContentsDiff: 1
StoryURL: .*ebert/greatmovies/.*\.html
StoryCacheable: 1

StoryHeadline: <h[12]>(.*)</h[12]>
StoryStart: <!-- Begin Content --> 
StoryEnd: <!-- End Content -->


filthy_critic.site:

URL: http://bigempire.com/filthy/
Name: The Filthy Critic
Levels: 1

StoryStart: <TD WIDTH="440" VALIGN="TOP">
StoryEnd: </HTML>


imdb_studio_briefing.site:

# IMDB.com Movie/TV news
# Author: Jan Lund Thomsen <kwed@kwed.org>

URL:		http://us.imdb.com/StudioBrief/
Name:		IMDB Movie/TV news
Levels:		1

AuthorName:     Jan Lund Thomsen
AuthorEmail:    kwed@kwed.org

StoryStart: <!-studiodate -->
StoryEnd: <A HREF="mailto:studiobrf@aol.com">Studio Briefing</A> Edited by <A HREF="http://members.aol.com/studiobrf/lewirwin/lewsbio.html">Lew Irwin</A> 


roger_ebert.site:

# roger_ebert.site
# AuthorName: Justin Henry <jhenry@fjicl.com>
# Modified:  Alan Hoyle <alan /at/ alanhoyle.com>
#
# Modified to read the Ebert review index page, and to deal
# with a new SunTimes page format  
# Modified to exclude extraneous bottom of page stuff.

URL: http://www.suntimes.com/index/ebert1.html
Name: Roger Ebert Reviews
Description: Roger Ebert's Movie Reviews
Levels: 2
Category: Daily

ContentsStart: <!-- Begin Content -->
ContentsEnd:  End Content
StoryURL: .*ebert1/.*\.html
StoryCacheable: 1

StoryHeadline: <h[12]>(.*)</h[12]>
StoryStart: <!-- Begin Content -->
StoryEnd: End Content


variety.site:

URL: http://www.variety.com/channel
Name: Variety.Com
Levels: 2
ContentsPrint: 1
ImageURL: http://.*
#
# This site was converted from an AvantGo .subs file by subs-to-site.pl.
# See http://sitescooper.org/ for more information on sitescooper.


Category: comics


apartment_3g.site:

URL: http://www.kingfeatures.com/features/comics/apt3g/aboutMaina.php
Name: Apartment 3-G
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Apartment_3-G.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


baby_blues.site:

URL: http://www.kingfeatures.com/features/comics/babyblue/aboutMaina.php
Name: Baby Blues
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Baby_Blues.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


barney_google_and_snuffy_smith.site:

URL: http://www.kingfeatures.com/features/comics/bgoogle/aboutMaina.php
Name: Barney Google and Snuffy Smith
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Barney_Google.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


beetle_bailey.site:

URL: http://www.kingfeatures.com/features/comics/bbailey/aboutMaina.php
AuthorName: Marko Bozikovic <marko.bozikovic /at/ envox.hr>
Name: Beetle Bailey
StoryStart: <!--CMS NAME="image"-->
StoryEnd: by <!--CMS NAME="author"
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Beetle_Bailey.*
ImageScaleToMaxWidth: 500


better_half.site:

URL: http://www.kingfeatures.com/features/comics/bethalf/aboutMaina.php
Name: The Better Half
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Better_Half.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


between_friends.site:

URL: http://www.kingfeatures.com/features/comics/bfriends/aboutMaina.php
Name: Between Friends
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Between_Friends.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


blondie.site:

URL: http://www.kingfeatures.com/features/comics/blondie/aboutMaina.php
Name: Blondie
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Blondie.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


boondocks.site:


URL: http://www.ucomics.com/boondocks/
  AuthorName: Ignatz Sol [iggy /at/ mechanolatry.com]
  Name: Boondocks
  StoryStart: <!--- comics view content --->
  StoryEnd: <!--calendar-->
  StoryDiff: 1
  ImageOnlySite: 1
  ImageURL: http://images.ucomics.com/comics/bo/200\d/bo.*
  #ImageScaleToMaxWidth: 450
  UseTableSmarts: 0


buckles.site:

URL: http://www.kingfeatures.com/features/comics/buckles/aboutMaina.php
Name: Buckles
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Buckles.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


calvin_and_hobbes.site:

URL: http://www.ucomics.com/calvinandhobbes/viewch.cfm
  AuthorName: Marko Bozikovic <redbyron /at/ fly.srk.fer.hr> modified by Gary Paulson
  Name: Calvin and Hobbes

  StoryStart: <!-- end comic nav -->
  # ?did not work? StoryStart: \gtimg src="http://a828.g.akamai.net
  StoryEnd: <!--calendar-->

  ImageOnlySite: 1
  ImageURL: .*/ch/\d\d\d\d/ch.*\.gif
  ImageScaleToMaxWidth: 550

  StoryHTMLPreProcess: {
     s!<a href..http.//www.ucomics.com/shopping/buycomic.cfm.uc_fn=1.uc_full_date=\d+?.uc_daction.X.uc_comic=ch.>!!gsi;
   }


crock.site:

URL: http://www.kingfeatures.com/features/comics/crock/aboutMaina.php
Name: Crock
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Crock.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


curtis.site:

URL: http://www.kingfeatures.com/features/comics/curtis/aboutMaina.php
Name: Curtis
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Curtis.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


dennis_the_menace.site:

URL: http://www.kingfeatures.com/features/comics/dennis/aboutMaina.php
AuthorName: Marko Bozikovic <marko.bozikovic /at/ envox.hr>
Name: Dennis the Menace
StoryStart: <!--CMS NAME="image"-->
StoryEnd: by <!--CMS NAME="author"
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Dennis_The_Menace.*
ImageScaleToMaxWidth: 500


dilbert.site:


URL: http://www.dilbert.com/comics/dilbert/
  AuthorName: Kevin L. Dupree <kdupree /at/ flash.net>
  Name: Dilbert
  StoryStart: COMIC STRIP BEGIN
  StoryEnd: COMIC STRIP END
  StoryDiff: 1
  ImageOnlySite: 1
  ImageURL: http://www.dilbert.com/comics/dilbert/archive/images/.*
  ImageScaleToMaxWidth: 450
  UseTableSmarts: 0

  # add size info so sitescooper knows to make it into a
  # link for Plucker.
  StoryHTMLPreProcess: {
    s/ALT="Today.s Dilbert Comic"/
    	ALT="Today.s Dilbert Comic" WIDTH=600 HEIGHT=211
    /gs;
  }


dinette_set.site:

URL: http://www.kingfeatures.com/features/comics/dinette/aboutMaina.php
Name: The Dinette Set
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Dinette_Set.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


doonesbury.site:


URL: http://www.doonesbury.com/strip/dailydose/index.html
  AuthorName: Ignatz Sol [iggy /at/ mechanolatry.com]
  Name: Doonesbury
  StoryStart: no next date
  StoryEnd: dose_feature_menu4_01.gif
  StoryDiff: 1
  ImageOnlySite: 1
  ImageURL: http://images.ucomics.com/comics/db/200\d/db.*
  #ImageScaleToMaxWidth: 500
  UseTableSmarts: 0


edge_city.site:

URL: http://www.kingfeatures.com/features/comics/edgecity/aboutMaina.php
Name: Edge City
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Edge_City.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


family_circus.site:

URL: http://www.kingfeatures.com/features/comics/familyc/aboutMaina.php
Name: Family Circus
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/familyc/fct.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ rustymail.com


flash_gordon.site:

URL: http://www.kingfeatures.com/features/comics/fgordon/aboutMaina.php
Name: Flash Gordon
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/fgordon/fg.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


funky_winkerbean.site:

URL: http://www.kingfeatures.com/features/comics/fwinker/aboutMaina.php
Name: Funky Winkerbean
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Funky_Winkerbean.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


grin_and_bear_it.site:

URL: http://www.kingfeatures.com/features/comics/grinbear/aboutMaina.php
Name: Grin and Bear It
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Grin_and_Bear_It.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


hagar_the_horrible.site:

URL: http://www.kingfeatures.com/features/comics/hagar/aboutMaina.php
AuthorName: Marko Bozikovic <marko.bozikovic /at/ envox.hr>
Name: Hagar the Horrible
StoryStart: <!--CMS NAME="image"-->
StoryEnd: by <!--CMS NAME="author"
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Hagar_The_Horrible.*
ImageScaleToMaxWidth: 450



hazel.site:

URL: http://www.kingfeatures.com/features/comics/hazel/aboutMaina.php
Name: Hazel
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/hazel/hat.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


henry.site:

URL: http://www.kingfeatures.com/features/comics/henry/aboutMaina.php
Name: Henry
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/henry/het.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


hi_and_lois.site:

URL: http://www.kingfeatures.com/features/comics/hi_lois/aboutMaina.php
Name: Hi and Lois
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Hi_and_Lois.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


judge_parker.site:

URL: http://www.kingfeatures.com/features/comics/jparker/aboutMaina.php
Name: Judge Parker
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Judge_Parker.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


katzenjammer_kids.site:

URL: http://www.kingfeatures.com/features/comics/katzkids/aboutMaina.php
Name: The Katzenjammer Kids
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/katzkids/kk.*\.jpg
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


lockhorns.site:

URL: http://www.kingfeatures.com/features/comics/lockhorn/aboutMaina.php
Name: The Lockhorns
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Lockhorns.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


mallard_fillmore.site:

URL: http://www.kingfeatures.com/features/comics/mallard/aboutMaina.php
Name: Mallard Fillmore
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Mallard_Fillmore.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


mandrake_the_magician.site:

URL: http://www.kingfeatures.com/features/comics/mandrake/aboutMaina.php
Name: Mandrake the Magician
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/mandrake/mmt.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


mark_trail.site:

URL: http://www.kingfeatures.com/features/comics/mtrail/aboutMaina.php
Name: Mark Trail
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Mark_Trail.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


marvin.site:

URL: http://www.kingfeatures.com/features/comics/marvin/aboutMaina.php
Name: Marvin
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Marvin.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


mary_worth.site:

URL: http://www.kingfeatures.com/features/comics/mworth/aboutMaina.php
Name: Mary Worth
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Mary_Worth.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


moose_and_molly.site:

URL: http://www.kingfeatures.com/features/comics/moosemol/aboutMaina.php
Name: Moose and Molly
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/moosemol/mot.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


mutts.site:

URL: http://www.kingfeatures.com/features/comics/mutts/aboutMaina.php
Name: Mutts
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Mutts.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


norm.site:

URL: http://www.kingfeatures.com/features/comics/thenorm/aboutMaina.php
Name: The Norm
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Norm.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


on_the_fastrack.site:

URL: http://www.kingfeatures.com/features/comics/fastrack/aboutMaina.php
Name: On The Fastrack
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Fast_Track.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


phantom.site:

URL: http://www.kingfeatures.com/features/comics/phantom/aboutMaina.php
Name: The Phantom
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Phantom.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


piranha_club.site:

URL: http://www.kingfeatures.com/features/comics/piranha/aboutMaina.php
Name: The Piranha Club
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Piranha.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


popeye.site:

URL: http://www.kingfeatures.com/features/comics/popeye/aboutMaina.php
AuthorName: Marko Bozikovic <redbyron /at/ fly.srk.fer.hr>
Name: Popeye
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Popeye.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean (revision)
AuthorEmail: yoonfui /at/ bigfoot.com


prince_valiant.site:

URL: http://www.kingfeatures.com/features/comics/pvaliant/aboutMaina.php
Name: Prince Valiant
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/pvaliant/val.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


redeye.site:

URL: http://www.kingfeatures.com/features/comics/redeye/aboutMaina.php
Name: Redeye
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Redeye.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


rex_morgan_md.site:

URL: http://www.kingfeatures.com/features/comics/rmorgan/aboutMaina.php
Name: Rex Morgan M.D.
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Rex_Morgan.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


rhymes_with_orange.site:

URL: http://www.kingfeatures.com/features/comics/orange/aboutMaina.php
Name: Rhymes With Orange
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Rhymes_with_Orange.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com




safe_havens.site:

URL: http://www.kingfeatures.com/features/comics/safehavn/aboutMaina.php
Name: Safe Havens
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Safe_Havens.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


sally_forth.site:

URL: http://www.kingfeatures.com/features/comics/sforth/aboutMaina.php
Name: Sally Forth
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Sally_Forth.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


sam_and_silo.site:

URL: http://www.kingfeatures.com/features/comics/sam_silo/aboutMaina.php
Name: Sam and Silo
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/sam_silo/sst.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


shermans_lagoon.site:

URL: http://www.kingfeatures.com/features/comics/lagoon/aboutMaina.php
Name: Sherman's Lagoon
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Shermans_Lagoon.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


six_chix.site:

URL: http://www.kingfeatures.com/features/comics/sixchix/aboutMaina.php
Name: Six Chix
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/6Chix.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


slylock_fox.site:

URL: http://www.kingfeatures.com/features/comics/slylock/aboutMaina.php
Name: Slylock Fox
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Slylock.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


spiderman.site:

URL: http://www.kingfeatures.com/features/comics/spidermn/aboutMaina.php
AuthorName: Marko Bozikovic <marko.bozikovic /at/ envox.hr>
Name: The Amazing Spiderman
StoryStart: <!--CMS NAME="image"-->
StoryEnd: by <!--CMS NAME="author"
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Spiderman.*
ImageScaleToMaxWidth: 500


steve_roper_and_mike_nomad.site:

URL: http://www.kingfeatures.com/features/comics/sroper/aboutMaina.php
Name: Steve Roper and Mike Nomad
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Steve_Roper.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


tedrall.site:


URL: http://www.ucomics.com/rallcom/
  AuthorName: Ignatz Sol [iggy /at/ mechanolatry.com]
  Name: Ted Rall
  StoryStart: no next date
  StoryEnd: Get Ted Rall by e-mail
  StoryDiff: 1
  ImageOnlySite: 1
  ImageURL: http://images.ucomics.com/comics/trall/200\d/tr.*
  #ImageScaleToMaxWidth: 450
  UseTableSmarts: 0


theyll_do_it_every_time.site:

URL: http://www.kingfeatures.com/features/comics/theydoit/aboutMaina.php
Name: They'll Do It Every Time
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/TDIE.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


thismodernworld.site:

URL: http://www.thismodernworld.com/
  Name: This Modern World
  Description: This Modern World by Tom Tomorrow
  Levels: 1
  StoryDiff: 1
  # thx to Adrian Colley <aecolley AT spamcop net>


tiger.site:

URL: http://www.kingfeatures.com/features/comics/tiger/aboutMaina.php
Name: Tiger
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Tiger.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


trudy.site:

URL: http://www.kingfeatures.com/features/comics/trudy/aboutMaina.php
Name: Trudy
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://www.kingfeatures.com/features/comics/trudy/trt.*\.gif
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


tumbleweeds.site:

URL: http://www.kingfeatures.com/features/comics/tumblewd/aboutMaina.php
Name: Tumbleweeds
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Tumbleweeds.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


user_friendly.site:


URL: http://www.userfriendly.org/static/
  AuthorName: Kevin L. Dupree <kdupree /at/ flash.net>
  Name: User Friendly
  StoryStart: <!--Start Current Strip-->
  StoryEnd: <!--End Strip-->
  StoryDiff: 1
  ImageOnlySite: 1
  ImageURL: /cartoons/archives/.*\.gif
  ImageScaleToMaxWidth: 550


zippy_the_pinhead.site:

URL: http://www.kingfeatures.com/features/comics/zippy/aboutMaina.php
Name: Zippy The Pinhead
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Zippy_the_Pinhead.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


zits.site:

URL: http://www.kingfeatures.com/features/comics/zits/aboutMaina.php
Name: Zits
StoryStart: <!--CMS NAME="image"-->
StoryEnd: <!--/CMS-->
StoryDiff: 1
ImageOnlySite: 1
ImageURL: http://est.rbma.com/content/Zits.*
ImageScaleToMaxWidth: 500

AuthorName: Yoon Fui Thean
AuthorEmail: yoonfui /at/ bigfoot.com


Category: culture


world_new_york.site:

URL: http://www.worldnewyork.net/
Name: World New York
Description: Links to, and extracts from, quality writing on the web

Levels: 2
ContentsStart: <!-- Weblog entries -->
ContentsEnd: !-- Link to RSS Syndication page -->
StoryURL: http://www.worldnewyork.net/comments.php\?id=\S+

StoryStart: <div class="lgbody"><p>
StoryEnd: <h4>COMMENTS</h4>

# site file author details
AuthorName: Justin Mason
AuthorEmail: jm@jmason.org

# This site gets bonus points for linking to the palm version as the
# "AvantGo/SiteScooper/Palm Version" in its early days ;)


Category: fortune


oracularities.site:

URL: http://www.cs.indiana.edu/hyplan/oracle/latest.html
Name: The Internet Oracle

Levels: 2
ContentsStart: <body>
ContentsEnd: </body>

StoryURL: http://www.cs.indiana.edu/hyplan/oracle/digests/.*\.html
StoryStart: <body>
StoryEnd: </body>

StoryHTMLPreProcess: {
  s/<form/<ignore/g; s/<\/form>/<\/ignore>/g;
}

MinPages: 2


wingmail.site:

# From: artwells <artwells@artwells.com>

URL:		http://www.artwells.com/oracula/web/serve-wing.php?wingrequest=[[YYYY]][[MM]][[DD]]
Name:		Wingmail Daily
Levels:		1


Category: games


gamasutra_features.site:

URL: http://www.gamasutra.com/features/
  Name: GamaSutra
  Levels: 2
  ContentsStart: .BeginEditable "content"
  ContentsEnd: -- .BeginLibraryItem "/Library/.*_footer.lbi" --
  StoryStart: -- .BeginEditable "main.20content"
  StoryEnd: -- .BeginLibraryItem "/Library/.*_footer.lbi" --

  # We only read linked stories for features, not for newswire items.
  # ah shaggit, let's get the newswires too.
  StoryURL: .*(features|newswire)/.*\.htm.*

  # Need to follow links into other story pages
  StoryFollowLinks: 1
  StoryHeadline: <title>Gamasutra - \S+ - (.*?) \[.*?\]\s*</title>


gamedev_net.site:

URL:		http://www.gamedev.net/xml/
Name:		GameDev.net
Description:	Maximum Game Development!
ContentsFormat:	rss

StoryURL:	/info/news/fullstory.asp.*


happypenguin.site:

URL: http://www.happypenguin.org/news
Name: Linux Game Tome
Description: The latest Linux game news

Levels: 2

ContentsStart: <form method="GET" action="http://happypenguin.org/news">
ContentsEnd: <a href="http://happypenguin.org/news?start=10">

StoryURL: http://.*happypenguin.org/show.*
StoryStart: <tr bgcolor=#000080><td width="20" valign=top align=left><img src="http://happypenguin.org/images/tl.gif" width=20 height=20 alt=""></td>
StoryEnd: </HTML>

ContentsUseTableSmarts: 0
StoryUseTableSmarts: 0
TableRender: flatten


oldmanmurray.site:

URL: http://www.oldmanmurray.com/
Name: Old Man Murray
Description: Game news and reviews with a thoroughly nasty flavour

TableRender: flatten
Levels: 2

ContentsStart: Make sure to check to the left for all the latest on OldManMurray.com</SMALL></TD>

StoryURL: http://www.oldmanmurray.com/(features|shortreviews|longreviews|seanbaby)/.*html.*
StoryURL: http://www.oldmanmurray.com/realnews.wcs
StoryFollowLinks: 1

StoryStart: src="http://www.oldmanmurray.com/logoimages/ugologo\S+.gif"


Category: humor


bofh-2k+1.site:

URL: http://www.theregister.co.uk/content/30/25244.html
  Name: 2001: A BOFH Odyssey
  Description: Bastard Operator From Hell: 2001 Edition
  AuthorName: Barry Dexter A. Gonzaga
  AuthorEmail: barryg /at/ kssp.upd.edu.ph

  Levels: 2
  StoryURL: /content/archive/\d+\.html
  ContentsStart: <HR>
  ContentsEnd: <BR></DIV>.<DIV><IMG.SRC=
  StoryStart: <HR>
  StoryEnd: <BR></DIV>.<DIV><IMG.SRC=

  StoryHTMLPreProcess: {
	s/<DIV CLASS=.storyhead.>(.*?)<\/DIV>/<H2>$1<\/H2>/is;
	s/<DIV CLASS=.storybyline.>(.*?)<\/DIV>/<H3>$1<\/H3>/is;
	s/<DIV CLASS=.indexposted.>(.*?)<\/DIV>/<H3>$1<\/H3>/is;
	s/<DIV CLASS=.storybody.><b>(.*?)<\/b>/<H4>$1<\/H4>/is;
	s/<br>.<br>(.*?)<br>.<br>/<\/p><p>$1<\/p><p>/gs;
  }

  StoryPostProcess: {
	s/<b><b>//is;
	s/<i><i>//is;
	s/<\/H4>.<\/p>/<\/H4>/is;
  }


bofh-2k.site:

URL: http://www.theregister.co.uk/content/30/15804.html
  Name: BOFH 2K: The Kit and caboodle
  Description: Bastard Operator From Hell: 2000 Edition
  AuthorName: Barry Dexter A. Gonzaga
  AuthorEmail: barryg /at/ kssp.upd.edu.ph

  Levels: 2
  StoryURL: /content/\d+/\d+\.html
  ContentsStart: <HR>
  ContentsEnd: <BR></DIV>.<DIV><IMG.SRC=
  StoryStart: <HR>
  StoryEnd: <BR></DIV>.<DIV><IMG.SRC=

  StoryHTMLPreProcess: {
	s/<DIV CLASS=.storyhead.>(.*?)<\/DIV>/<H2>$1<\/H2>/is;
	s/<DIV CLASS=.storybyline.>(.*?)<\/DIV>/<H3>$1<\/H3>/is;
	s/<DIV CLASS=.indexposted.>(.*?)<\/DIV>/<H3>$1<\/H3>/is;
	s/<DIV CLASS=.storybody.><b>(.*?)<\/b>/<H4>$1<\/H4>/is;
	s/<br>.<br>(.*?)<br>.<br>/<\/p><p>$1<\/p><p>/gs;
  }

  StoryPostProcess: {
	s/<b><b>//is;
	s/<i><i>//is;
	s/<\/H4>.<\/p>/<\/H4>/is;
  }


bofh.site:

# Bastard Operator from Hell
URL: http://www.theregister.co.uk/content/30/index.html
  Name: BOFH
  Levels: 2
  StoryURL: /content/\d+/\d+\.html
  StoryCacheable: 1
  MinPages: 2
  StoryUseTableSmarts: 0
  ContentsUseTableSmarts: 0
  ContentsStart: <IFRAME SRC=.http://ad.uk.doubleclick.net/
  ContentsEnd: <TD WIDTH="150" ALIGN="right" VALIGN="top">

  StoryHTMLPreProcess: {
    s/<DIV CLASS=.story_head.>(.*?)<\/DIV>/<H2 CLASS='story_head'>$1<\/H2>/is;
    s/<br>.<br><B>Related (?:[sS]tory|[sS]tories|[lL]ink|[lL]inks)<\/B>.*\Z//s;
    s/<br>+/<br>/i;
    s/<br><p>(?:<br>)*/<p>/i;
  }
  MinPages: 2

AuthorName: Robert Edmonds <stu@brainfood.com>


bofh_archive.site:

# Bastard Operator from Hell official archive
URL: http://bofh.ntk.net/Bastard.html
AuthorName: Marko Bozikovic <marko.bozikovic /at/ envox.hr>
Name: Bastard Operator from Hell
Levels: 2

StoryURL: .*\.html
StoryCachable: 1


dave_barry.site:

URL: http://www.miami.com/mld/miamiherald/living/columnists/dave_barry/
Name: Dave Barry
Description: Dave Barry's column for the Miami Herald
AuthorName: (update) Alan Hoyle <alan /at/ alanhoyle.com>
Levels: 2

ContentsStart: <td class="smalltitle" nowrap="nowrap">LATEST COLUMN
ContentsEnd: rightrail

#StoryURL: .*/dave_barry/.*\.htm   
#StoryURL: .*/gift_guide/.*\.htm   
StoryURL: .*\.htm
StoryStart: begin body-content
StoryEnd: end body-content
StoryHeadline: <h1>(.*?)</h1>

ContentsHTMLPreProcess: {
    s/(<td><hr size=\"1\" color=\"\#cccccc\" width=\"98\%\"><\/td>)//gm;
    s/(ADVERTISEMENT)//gm;
    s/^.*(Get in touch).*$//gm;
    s/^.*(davebarry).*$//gm;
    s/^.*(weird_news).*$//gm;
    s/^.*(vertdotline).*$//gm;


}
StoryHTMLPreProcess: {
    s/^.*(byline).*$//gm;
    s/^.*(Read more).*$//gm;
}


jon_carroll.site:

# site_samples/humor/jon_carroll.site
#
# San Francisco Chronicle : Columnists : Jon Carroll

URL:		http://www.sfgate.com/columnists/carroll/
Name:		Jon Carroll
Levels:		2

AuthorName:     Jan Lund Thomsen
AuthorEmail:    kwed@kwed.org

ContentsStart: <!-- \*\*\*\*\* BEGIN COLUMN RESULTS HERE \*\*\*\*\* -->
ContentsEnd: <!-- \*\*\*\*\* END COLUMN RESULTS HERE \*\*\*\*\* -->

StoryURL: http://www.sfgate.com/cgi-bin/article.cgi.*
StoryToPrintableSub: s/(.+)/$1\&type=printable/
StoryStart: <hr size="1" align="left">

#StoryStart: <!-- end #additionalcontent -->
#StoryEnd: <!-- END STORY -->


pigdog.site:

# Pigdog Journal
URL:            http://www.pigdog.org/pigdog.rdf
Name:           Pigdog Journal
Description:    The Online Handbook of Bad People of the Future
ContentsFormat: rss

StoryURL:       /.*.s?html?
StoryStart:	Feedback<br>
StoryEnd:	<td background="images/rightborder.gif">
ContentsStart:	<item>

AuthorName: Robert Edmonds <stu@brainfood.com>


satirewire.site:

URL: http://www.satirewire.com/
Name: SatireWire
Description: New Satire for the New Economy

Levels: 2
ContentsStart: <table border="0" cellpadding="4" cellspacing="2" align="right" width="125">
ContentsEnd: <a href=".top">Back to Top</a>
ContentsUseTableSmarts: 0

StoryURL: http: