Quantcast
Channel: MobileRead Forums - Calibre
Viewing all articles
Browse latest Browse all 31494

python regex: delete text in preprocessing

$
0
0
Hi all,

I am using a calibre recipe (Weltonline; german daily newspaper) to fetch news daily. Everything works fine, but in the final epub file after every news article there is plenty of (web-) rubbish I want to get rid of. Therefore I use Sigil and work on the epub I have downloaded from my calibre server. I delete everything between two string groups:

Code:

(    <div class="calibre7">
                  © Axel Springer AG 2013. Alle Rechte vorbehalten)([\s\S ]*?)(Weitere Hinweise</a></li>)

Now I have seen that there is a way to preprocess regex expressions while fetching the news with calibre. But I have also seen that python requires a slightly different approach towards regex and I am no expert in different regex dialects nor am I a python programmer.Took me quite a while to figure out the regex above :).

Could someone tell me how to use the above regex in the forementioned recipe in a way to use this preprocessing?

Thanks,
Sebastian

Viewing all articles
Browse latest Browse all 31494

Trending Articles