feeds2disk(1)
NAME
feeds2disk - part of calibre
SYNOPSIS
feeds2disk [options] ARG
DESCRIPTION
feeds2disk parses an online source of articles, like an RSS or ATOM
feed and fetches the article contents organized in a nice hierarchy.
ARG can be one of:
file name - feeds2disk will try to load a recipe from the
file
builtin recipe title - feeds2disk will load the builtin recipe and use
it to fetch the feed. For e.g. Newsweek or "The BBC" or "The New York
Times"
recipe as a string - feeds2disk will load the recipe directly from
the string arg.
Available builtin recipes are: u'168a', '24 Sata - Hr', '24 Sata - Sr',
u'ATV', u'Adventure Gamers', 'The Age', 'Al Jazeera in English',
'Ambito.com', 'The American Spectator', 'Arizona Daily Star', u'Ars
Technica', u'Associated Press', 'The Atlantic', u'The Australian',
'B92', u'The BBC', "Barron's", 'Blic', 'Borba Online', 'Business Week',
'CNN', u'Carta', 'Chicago Breaking News', 'Chicago Tribune', 'Christian
Science Monitor', u'Cincinnati Enquirer', 'Clarin', 'Climate Progress',
'Coding Horror', u'Common Dreams', 'Corriere della Sera', 'Corriere
della Sera in English', 'Courrier International', 'Critica de la
Argentina', u'Cyberpresse', 'DNA India', u'The Daily Mail', u'Daily
Telegraph', u'Danas', u'De Morgen', u'De Standaard', 'Diario Granma',
'Die Zeit Nachrichten', u'Discover Magazine', 'Dnevni Avaz', 'Dnevnik Hr', 'E-Novine', u'EL PAIS', 'ESPN', 'EcoGeek', 'The Economist', 'El
Cronista', 'El Mercurio online', 'El Mundo', 'El Universal',
'ElArgentino.com', u'Engadget', 'Exiled Online', 'FAZ NET', 'FTD',
u'Financial Times', u'Forbes', 'Freakonomics Blog', u'Fudzilla', 'Glas
Srpske', u'Glasgow Herald', 'Globe and Mail', 'Google Reader', u'The
Guardian', 'HNA', 'HRT: Vesti', u"Harper's Magazine", u"Harper's Magazine - articles from printed edition", u'The Hindu', 'Honolulu Advertiser', 'Honolulu Star-Bulletin', u'Hrszerz0151', u'INDEX.HU', u'Indianapolis Star', 'Infobae.com', 'Instapaper.com', u'The Irish Times',
u'The Japan Times', 'Jerusalem Post', 'Joel on Software', 'Jornal
Brasileiro Online', u'Jutarnji', 'Juventud Rebelde', 'Juventud Rebelde
in english', 'Krstarica - Vesti', 'Krstarica - news in english', 'La
Cuarta', 'La Mujer de mi Vida', 'La Nacion', 'La Nacion Chile', 'La
Prensa', 'La Prensa - Nicaragua', 'La Segunda', 'La Tercera',
'LeMonde.fr', u'Liberation', u'Linux Magazine', u'Linuxdevices', u'London Review of Books', u'The Los Angeles Times', 'MSDN Magazine', 'The
Market Ticker', 'Mediapart', 'The Miami Herald', 'Monde durable', 'Moneynews.com', u'The Moscow Times', 'NASA', 'NIN online', 'NSPM in
English', 'NZZ Online', 'Nacional - Hr', u'The Nation', 'New Scientist
- Online News', u'New York Review of Books', u'New York Review of Books
(no subscription)', 'The New York Times (subscription)', 'The New
Yorker', 'Newstimes', 'Newsweek', 'Nova srpska politicka misao', 'O
Estado de S. Paulo', 'O Globo', 'The Onion', 'Outlook India',
u'PCWORLD.HU', u'PHD Comics', 'Pagina/12', 'Pescanik', u'Physicstoday',
u'Physicsworld', 'Pobjeda Online', 'Politico', u'Politika Online',
'Portfolio', 'Press Online', 'RTS: Vesti', u'The Register', 'Reuters',
u'San Francisco Chronicle', u'Science AAAS', u'ScienceDaily', u'ScienceNews', u'Scientific American', u'The Scotsman', "Scott Hanselman's
Computer Zen", 'The Seattle Times', u'Shacknews', 'Soldiers', 'Spiegel
Online - German', u'Spiegel Online International', 'The St. Petersburg
Times', 'Stack Overflow - Blog', 'The Straits Times', u'Supersite for
Windows', 'Sydney Morning Herald', u'Sddeutsche', 'Tanjug', u'Telegraph.co.uk', u'Telepolis (Artikel)', u'Telepolis (News)', 'Teleread
Blog', u'Time', 'The Times Online', "Tom's Hardware US", 'Twitch
Films', 'US & World Report news', 'USA Today', 'United Press International', u'Utne reader', u'Vecernje Novosti', 'Vecernji List',
'Vijesti', 'Vreme', u'WOZ Die Wochenzeitung', 'The Wall Street Journal', 'Washington Post', 'Wikinews', 'Wired.com', u'derStandard',
u'diePresse', u'elektrolese', 'heise', u'la Repubblica', u'securitywatch', 'tomshardware', 'xkcd', 'zdnet', u'8054540865e962a57f51
zaobao.com'
Whenever you pass arguments to feeds2disk that have spaces in them,
enclose the arguments in quotation marks.
OPTIONS
- --version
- show program's version number and exit
- -h, --help
- show this help message and exit
- --feeds
- Specify a list of feeds to download. For example:
"['http://feeds.newsweek.com/newsweek/TopNews',
'http://feeds.newsweek.com/headlines/politics']" If you specify this option, any argument to feeds2disk is ignored and a default recipe is used to download the feeds. - --verbose
- Be more verbose while processing.
- --title
- The title for this recipe. Used as the title for any ebooks created from the downloaded feeds.
- --username
- Username for sites that require a login to access content.
- --password
- Password for sites that require a login to access content.
- --lrf Optimize fetching for subsequent conversion to LRF.
- --recursions
- Number of levels of links to follow on webpages that are linked to from feeds. Defaul 0
- --output-dir
- The directory in which to store the downloaded feeds. Defaults to the current directory.
- --no-progress-bar
- Dont show the progress bar
- --debug
- Very verbose output, useful for debugging.
- --test Useful for recipe development. Forces max_articles_per_feed to 2
- and downloads at most 2 feeds.
- WEB2DISK OPTIONS
- Options to control web2disk (used to fetch websites linked from feeds)
- -t, --timeout
Timeout in seconds to wait for a response from the server. Default: 10.0 s
- --delay
Minimum interval in seconds between consecutive fetches. Default is 0 s
- --encoding
The character encoding for the websites you are trying to download. The default is to try and guess the encoding.
- --match-regexp
Only links that match this regular expression will be followed. This option can be specified multiple times, in which case as long as a link matches any one regexp, it will be followed. By default all links are followed.
- --filter-regexp
Any link that matches this regular expression will be ignored. This option can be specified multiple times, in which case as long as any regexp matches a link, it will be ignored.By default, no links are ignored. If both --filter-regexp and --match-regexp are specified, then --filter-regexp is applied first.
- --dont-download-stylesheets
Do not download CSS stylesheets.
SEE ALSO
The User Manual is available at http://calibre.kovidgoyal.net/user_manual
- Created by Kovid Goyal <kovid@kovidgoyal.net>