This is a list with used regular expressions and 
keywords for removing and cleaning headlines.

Removing headlines containing dates
------------------------------------
Regular Exp.: 
\d{2}(\-|\.)\d{2}(\-|.)\d{4}: matches dd.mm.yyyy and dd-mm-yyyy
\d{2}\.\d{2}: matches dd.mm


Filtering headline grabber
--------------------------
Regular Exp.:
^\*+\s\w+\s\w+\s\*+: matches e.g. *** Hello World ***
\++(.*)\++\s-: matches e.g. ++ Hello World ++ 
^(News|BILD)\s\-\s: matches both News - or BILD - 


Removing headlines beginning with these keywords
------------------------------------------------ 
Liveblog,
Live,
LIVE,
NFL-TICKER,
Spieltag,
Exklusive,
Kommentare,
Meinung(en),
Interview,
Video,
TV,
Pop,
Show 
