1
0
Fork 0
mirror of https://gitlab.com/news-flash/article_scraper.git synced 2025-07-08 08:30:00 +02:00
article_scraper/resources/tests/readability
2023-04-02 13:22:16 +02:00
..
001 whitespace fixes 2023-03-24 08:02:08 +01:00
002 whitespace fixes 2023-03-24 08:02:08 +01:00
003 whitespace fixes 2023-03-24 08:02:08 +01:00
aclu whitespace fixes 2023-03-24 08:02:08 +01:00
aktualne whitespace fixes 2023-03-24 08:02:08 +01:00
archive-of-our-own whitespace fixes 2023-03-24 08:02:08 +01:00
ars-1 whitespace fixes 2023-03-24 08:02:08 +01:00
base-url-base-element-relative fix url completion for hash urls 2023-03-30 21:27:35 +02:00
basic-tags-cleaning whitespace fixes 2023-03-24 08:02:08 +01:00
bbc-1 whitespace fixes 2023-03-24 08:02:08 +01:00
blogger whitespace fixes 2023-03-24 08:02:08 +01:00
breitbart whitespace fixes 2023-03-24 08:02:08 +01:00
bug-1255978 whitespace fixes 2023-03-24 08:02:08 +01:00
buzzfeed-1 whitespace fixes 2023-03-24 08:02:08 +01:00
citylab-1 update lazy image fixing code 2023-03-27 21:10:48 +02:00
clean-links clean js-links & add new test 2023-03-26 11:31:59 +02:00
cnet whitespace fixes 2023-03-24 08:02:08 +01:00
cnet-svg-classes 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
cnn whitespace fixes 2023-03-24 08:02:08 +01:00
comment-inside-script-parsing whitespace fixes 2023-03-24 08:02:08 +01:00
daringfireball-1 whitespace fixes 2023-03-24 08:02:08 +01:00
data-url-image update lazy image fixing code 2023-03-27 21:10:48 +02:00
dev418 whitespace fixes 2023-03-24 08:02:08 +01:00
dropbox-blog whitespace fixes 2023-03-24 08:02:08 +01:00
ebb-org whitespace fixes 2023-03-24 08:02:08 +01:00
ehow-1 more tests & title fixes 2023-03-29 08:35:36 +02:00
ehow-2 whitespace fixes 2023-03-24 08:02:08 +01:00
embedded-videos more tests & title fixes 2023-03-29 08:35:36 +02:00
engadget fix url completion for hash urls 2023-03-30 21:27:35 +02:00
firefox-nightly-blog update lazy image fixing code 2023-03-27 21:10:48 +02:00
folha whitespace fixes 2023-03-24 08:02:08 +01:00
gmw whitespace fixes 2023-03-24 08:02:08 +01:00
google-sre-book-1 fix url completion for hash urls 2023-03-30 21:27:35 +02:00
guardian-1 fix url completion for hash urls 2023-03-30 21:27:35 +02:00
heise whitespace fixes 2023-03-24 08:02:08 +01:00
herald-sun-1 whitespace fixes 2023-03-24 08:02:08 +01:00
hidden-nodes whitespace fixes 2023-03-24 08:02:08 +01:00
hukumusume clean js-links & add new test 2023-03-26 11:31:59 +02:00
iab-1 whitespace fixes 2023-03-24 08:02:08 +01:00
ietf-1 fix url completion for hash urls 2023-03-30 21:27:35 +02:00
js-link-replacement clean js-links & add new test 2023-03-26 11:31:59 +02:00
keep-images more tests & title fixes 2023-03-29 08:35:36 +02:00
keep-tabular-data fix strip unlikely table-child & add 2 new tests 2023-03-26 11:54:13 +02:00
la-nacion update lazy image fixing code 2023-03-27 21:10:48 +02:00
lazy-image-1 update lazy image fixing code 2023-03-27 21:10:48 +02:00
lazy-image-2 update lazy image fixing code 2023-03-27 21:10:48 +02:00
lazy-image-3 update lazy image fixing code 2023-03-27 21:10:48 +02:00
lemonde-1 4 more tests 2023-03-28 07:25:05 +02:00
liberation-1 4 more tests 2023-03-28 07:25:05 +02:00
lifehacker-post-comment-load 4 more tests 2023-03-28 07:25:05 +02:00
lifehacker-working 4 more tests 2023-03-28 07:25:05 +02:00
links-in-tables more tests & title fixes 2023-03-29 08:35:36 +02:00
lwn-1 more tests & title fixes 2023-03-29 08:35:36 +02:00
medicalnewstoday fix medialnewstoday test 2023-03-30 07:58:11 +02:00
medium-1 more tests & title fixes 2023-03-29 08:35:36 +02:00
medium-2 more tests & title fixes 2023-03-29 08:35:36 +02:00
medium-3 more tests & title fixes 2023-03-29 08:35:36 +02:00
mercurial fix url completion for hash urls 2023-03-30 21:27:35 +02:00
metadata-content-missing 2 passing test & 2 failing tests 2023-03-29 18:08:00 +02:00
missing-paragraphs 2 passing test & 2 failing tests 2023-03-29 18:08:00 +02:00
mozilla-1 mozilla test consitency 2023-03-30 21:35:31 +02:00
mozilla-2 3 more tests 2023-03-31 07:09:13 +02:00
msn 3 more tests 2023-03-31 07:09:13 +02:00
normalize-spaces 3 more tests 2023-03-31 07:09:13 +02:00
nytimes-1 start adding nytimes tests 2023-03-31 09:37:23 +02:00
nytimes-2 start adding nytimes tests 2023-03-31 09:37:23 +02:00
nytimes-3 fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
nytimes-4 fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
nytimes-5 fix nytimes-3 2023-03-31 10:38:04 +02:00
pixnet adding more tests 2023-03-31 11:23:44 +02:00
qq qq -.- 2023-03-31 21:21:14 +02:00
quanta-1 adding more tests 2023-03-31 11:23:44 +02:00
remove-aria-hidden adding more tests 2023-03-31 11:23:44 +02:00
remove-extra-paragraphs adding more tests 2023-03-31 11:23:44 +02:00
remove-script-tags adding more tests 2023-03-31 11:23:44 +02:00
reordering-paragraphs adding more tests 2023-03-31 11:23:44 +02:00
replace-font-tags fix replacing font tags 2023-04-01 12:31:56 +02:00
salon-1 4 more test & remove share elements 2023-04-01 17:19:37 +02:00
seattletimes-1 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
simplyfound-1 4 more test & remove share elements 2023-04-01 17:19:37 +02:00
social-buttons 4 more test & remove share elements 2023-04-01 17:19:37 +02:00
style-tags-removal 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
svg-parsing 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
table-style-attributes 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
telegraph 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
title-and-h1-discrepancy 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
tmz-1 6 more tags & make seattletimes test consistent 2023-04-01 18:14:05 +02:00
toc-missing 6 more tests 2023-04-01 18:22:42 +02:00
topicseed-1 6 more tests 2023-04-01 18:22:42 +02:00
tumblr 6 more tests 2023-04-01 18:22:42 +02:00
v8-blog 6 more tests 2023-04-01 18:22:42 +02:00
videos-1 6 more tests 2023-04-01 18:22:42 +02:00
videos-2 6 more tests 2023-04-01 18:22:42 +02:00
wapo-1 fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
wapo-2 fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
webmd-1 whitespace fixes 2023-03-24 08:02:08 +01:00
webmd-2 fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
wikia fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
wikipedia fix relative srcset urls & more tests 2023-04-02 09:03:37 +02:00
wikipedia-2 port final tests from readability for now 2023-04-02 13:22:16 +02:00
wikipedia-3 port final tests from readability for now 2023-04-02 13:22:16 +02:00
wordpress fix hidden fallback images for wikipedia & add more tests 2023-04-02 09:55:25 +02:00
yahoo-1 port final tests from readability for now 2023-04-02 13:22:16 +02:00
yahoo-2 port final tests from readability for now 2023-04-02 13:22:16 +02:00
yahoo-3 port final tests from readability for now 2023-04-02 13:22:16 +02:00
yahoo-4 port final tests from readability for now 2023-04-02 13:22:16 +02:00
youth port final tests from readability for now 2023-04-02 13:22:16 +02:00