Skip to content
This repository has been archived by the owner on Sep 28, 2023. It is now read-only.

"Full Content" does not fetch the entire article #751

Open
niveK77pur opened this issue May 30, 2023 · 0 comments
Open

"Full Content" does not fetch the entire article #751

niveK77pur opened this issue May 30, 2023 · 0 comments

Comments

@niveK77pur
Copy link

Describe the bug
Feeds from the https://arstechnica.com/ site only have the first few paragraphs or sections scraped when using Full Content. The rest of the article very consistently does not appear. It requires using View original to see everything until the end.

To Reproduce
Steps to reproduce the behavior:

  1. Add feed: http://feeds.arstechnica.com/arstechnica/index
  2. Fetch articles
  3. Compare the contents from Full Content and View original
  4. See how Full Content's text stops midway
  5. If 4. could not be observed, find another (longer) article and repeat from 3.

Expected behavior
The article should be scraped in its entirety.

Screenshots
The end of an article as seen using Full Content
VinLudensScreenshot

The same passage in the article as seen using View original (also see the scroll bar, the article goes on for much longer)
VinLudensScreenshot

Desktop:

  • OS: ArcoLinux (Arch)
  • Browser: Raven built-in (?)
  • Version: 1.0.79

Additional context

It appears as though content stops being shown after the 2nd or 3rd advertisement.

To be noted is that their articles tend to be quite long. So far, ArsTechnica is the most prominent one where I observe this behavior, so I am not too sure if it is an isolated case.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant