X-Git-Url: http://git.ozo.com/?a=blobdiff_plain;f=todolist;h=6d5acd6ec4896a89e41e08962190f4c16a2a0261;hb=cdd5dc01ecb50b3f8d9119c97ff2f3c84a8b3571;hp=70fbdfd1f5262f8babedf4b72a49a937ad7558e8;hpb=9cd1026fe72d7bb57acf332f392a7ddf4e1b62c2;p=rawdog%2F.git diff --git a/todolist b/todolist index 70fbdfd..6d5acd6 100644 --- a/todolist +++ b/todolist @@ -1,3 +1,11 @@ +FIXME: redo sorting hack in feedparser. + +If rawdog crashes while updating a feed, it shouldn't forget the feeds it's +already updated. Perhaps have an exception handler that keeps a safety copy of +the state file and saves where it's got to so far? + +Improve efficiency -- memoise stuff before comparing articles. + Detect duplicate articles (BBC News). Detect edited articles by guid. @@ -8,8 +16,6 @@ Option to choose whether full content or summary is preferred. Review expiry logic: is maxage=0 the same as currentonly? -Escape characters in text/plain? - Plugin that gets run on new articles. OPML listing -- needs feed type. @@ -31,16 +37,11 @@ specifying a feed. Add a needs_update() method to Feed; make Rawdog call that on all the feeds (when not being forced) and then call update() on each of them that needs it. -Make feedparser take a urllib2.Opener rather than proxies etc. - Where articles can be shown to have been updated (by unique ID), only show the latest version. For next release: - generalise templating (so the show template option takes a name, etc.) -- remove Article compat methods -- remove old option parsing -- add date to Article hash - require Python 2.3; kill 2.2 urllib2 workaround - use unicode.encode('ascii','xmlcharrefreplace') if possible?