Bug 14549 - cannot parse some opml file with bad characters
: cannot parse some opml file with bad characters
Status: RESOLVED WONTFIX
Product: Logitech Media Server
Classification: Unclassified
Component: Podcasts
: 7.4.1
: PC Debian Linux
: -- normal (vote)
: 7.4.x
Assigned To: Andy Grundman
: charset_issues
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-10-03 06:52 UTC by Nicolas
Modified: 2010-02-17 09:43 UTC (History)
2 users (show)

See Also:
Category: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Nicolas 2009-10-03 06:52:20 UTC
I'm using this opml list to get french radio podcast
http://www.netsecureweb.com/radiofr.opml

podcast url are generated automatically and some channels sent bad characters.
For example there are many "" (it seems unicode 0019 character: "^Y" in ascii traces) instead of single quote "'"

<outline text="" title="On va sgêner" xmlUrl="http://www.europe1.fr/rss_export/feed/on-va-s-gener-laurent-ruquier" type="rss"/>

I know the issue is mostly on this file but to prevent error it should be good to put a workaround in xmlToHash, which does already contain some encoding workaround

I do not understand why utf8decode_guess does not correct encoding issue

here are my traces before and after utf8decode_guess call

09-10-03 15:21:23.6144] Slim::Formats::XML::xmlToHash (630) before:    <outline text="" title="On va s^YgM-CM-*ner" xmlUrl="http://www.europe1.fr/rss_export/feed/on-va-s-gener-laurent-ruquier" type="rss"/>
[09-10-03 15:21:23.6147] Slim::Formats::XML::xmlToHash (632) after:     <outline text="" title="On va s^YgM-CM-*ner" xmlUrl="http://www.europe1.fr/rss_export/feed/on-va-s-gener-laurent-ruquier" type="rss"/>
Comment 1 James Richardson 2009-10-07 10:05:14 UTC
Andy/Michael: your thoughts on this?
Comment 2 Nicolas 2009-10-07 10:27:28 UTC
I've tested on 7.3.X because squeezescrobbler plugin does not work well on 7.4.X, and the issue is the same on this version.

It's clear that issue is in opml file but to prevent browsing failed in opml file, it may be converted to right charset or just ignore related item
Comment 3 Andy Grundman 2010-02-17 09:43:21 UTC
OPML must be valid XML.