Skip to content

Commit 7610638

Browse files
committed
Swedish sites
1 parent 229d6e4 commit 7610638

File tree

5 files changed

+44
-13
lines changed

5 files changed

+44
-13
lines changed

.expressen.se.txt

Lines changed: 0 additions & 6 deletions
This file was deleted.

aftonbladet.se.txt

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
author: //article//address[contains(@class, 'author')]
2+
body: //article[.//div[contains(@class, 'abBodyText')]]//*[contains(@class, 'abLeadText') or contains(@class, 'abBodyText') or contains(@class, 'abImageBlock') or contains(@class, 'abIGSatellite')]
3+
4+
strip: //address//img
5+
strip: //footer
6+
strip_id_or_class: abSticky
7+
8+
prune: no
9+
10+
test_url: http://www.aftonbladet.se/sportbladet/hockey/sverige/allsvenskan/article17498194.ab
11+
test_url: http://www.aftonbladet.se/debatt/article16207536.ab
12+
test_url: http://www.aftonbladet.se/debatt/debattamnen/politik/article17483377.ab
13+
test_url: http://www.aftonbladet.se/rss.xml

expressen.se.txt

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,10 @@
1-
title: //div[@id='article']/div[contains(@class, 'content')]/h1
2-
body: //div[@id='article']/div[contains(@class, 'content')]
3-
date: //div[contains(@class, 'article-slot')]/descendant::div[contains(@id, 'articledates')]
1+
title: //h1[contains(@class, 'b-headline_article')]
2+
body: //div[contains(@class, 'b-article_print')]
3+
4+
single_page_link: //div[contains(@class, 'b-page__footer__actions')]//a[contains(@href, 'print=true')]
45

5-
strip: //img[contains(@src, 'img/px.gif')]
66
prune: no
7-
# remove Facebook banner and obtrusive ad
8-
strip: //div[@id='article']/div[contains(@class, 'content')]/div[contains(@class, 'art-right')]
9-
test_url: http://www.expressen.se/kultur/1.2683904/medan-natet-dras-at
7+
8+
test_url: http://www.expressen.se/kultur/1.2683904/medan-natet-dras-at
9+
test_url: http://www.expressen.se/gt/polis-om-styckmordet-extremt-markligt-fall/
10+
test_url: http://www.expressen.se/Pages/OutboundFeedsPage.aspx?id=3642159&viewstyle=rss

resume.se.txt

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
date: //meta[@name='bi3dPubDate']/@content
2+
body: //div[contains(@class, 'articleBody')]
3+
4+
prune: no
5+
6+
test_url: http://www.resume.se/nyheter/media/2013/09/18/kvallspress-och-tv-slass-om-playtittarna-men-youtube-ohotat-storst/
7+
test_url: http://www.resume.se/nyheter/media/2013/09/18/cecilia-blankens-lamnar-mama-for-konkurrent/
8+
test_url: http://www.resume.se/nyheter/reklam/2013/09/18/ravelli-trodde-jag-var-med-i-blasningen/
9+
test_url: http://www.resume.se/rss-nyheter

svt.se.txt

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
title: //article[@role='main']//h1
2+
body: //article[@role='main']
3+
strip: //aside
4+
replace_string(<noscript>): <div>
5+
replace_string(</noscript>): </div>
6+
strip_id_or_class: svtHide-No-Js
7+
strip_id_or_class: aside
8+
strip_id_or_class: hidden
9+
tidy: no
10+
prune: no
11+
12+
test_url: http://www.svt.se/ug/framtidsdrommar-om-jobb-blev-lackande-gifthal
13+
test_url: http://www.svt.se/nyheter/het-debatt-mellan-borg-och-andersson
14+
test_url: http://www.svt.se/nyheter/regionalt/svtsormland/sj-tag-evakuerades-efter-rokdrama

0 commit comments

Comments
 (0)