a4k-openproject / script.module.openscrapers Goto Github PK

View Code? Open in Web Editor NEW

101.0 101.0 40.0 2.46 MB

OpenScrapers Project

License: GNU General Public License v3.0

Python 99.47% Batchfile 0.53%

script.module.openscrapers's People

Contributors

Stargazers

Watchers

script.module.openscrapers's Issues

anilist.py

should anilist.py have this changed:
from resources.lib.modules import
to this:
from openscrapers.modules import

With being in the UK I have to use a lot of proxy sites to get torrents to scrape ,I used this https://limetorrents.unblockit.me/ with openscrapers and others that used to work , but have stopped working sometime ago, is this because of cloudfare v2 change

settings... torrents

settings, for toggle torrents on or off still refer to script.module.civitasscrapers

Error:Loading module: provider | No module named xbmc

happens when running scrape-test.py.

minimal setup to reproduce:

import sys
import os
sys.path.append(os.path.join(os.path.curdir, 'lib'))
from lib import openscrapers
openscrapers.sources(None, True)

dead PL providers

please remove as the following website are dead and don't exist anymore

<setting id="provider.openkatalog" type="bool" label="OPENKATALOG" default="false" />
<setting id="provider.paczamy" type="bool" label="PACZAMY" default="false" />
<setting id="provider.trt" type="bool" label="TRT" default="false" />

v 0.0.0.7 cfscrape

v 0.0.0.7 the updated cfscrape seems to have broken rlsbb, way less premium links with rlsbb not working, i put the cfscrape from v 0.0.0.5 into v 0.0.0.7 and then rlsbb links are scraped and work

the way to add openscrapers to a addon

hi i am having real trouble getting this to work i followed each step 4 times now and im still getting the same issue if you have a telegram group can you send me a invite please so maybe someone can help me get it to work please.

re v199

Awesome job ! thanx to all involved, thumbs up to the new additions in the credits ;)

Furk scarper does not return any values

I did a fresh install of the latest Exodus Redux. Disabled all other providers except Furk. Set up my login credentials and API key. Tried a search and got the notification that there was no stream found. My search on Furk.net itself returns plenty results. Did a test with the default providers and got back plenty of results. Looks like the Furk scraper is broken. Can you please look into this?

Free provider ALLUCXYZ must be disabled for Open Scrapers to work.

Multiple friends using addons with Open Scrapers had issues, disabling ALLUCXYZ fixed the issue.

Torrent scraping broke after c06e458

c06e458
What is this debrid.tor_enabled() suppose to check? It breaks torrent scraping on my add-on, and exodus redux as well.
Removing this check from torrent scrapers makes them work again.
Are we supposed to implement an extra setting or something?
On a side note, this commit adds some lines with tabs, unlike the rest of the file's spaces indentation.

v197 xwatchseries

is xwatchseries broken

Vidics is down

Hi,
Just wanted to pass along that vidics is down. Could someone take a look at it?

Thanks.

Control module does not access Openscrapers settings.

Hi there.
I've written an Easynews scraper for Openscrapers, but there is a problem with the control.py file that should be used to access the settings of openscrapers.

The 'addon' variable (accessing xbmcaddon.Addon) needs to explicitly state the openscrapers id as it's 'id' arg...
addon = xbmcaddon.Addon(id='script.module.openscrapers')
at the moment it is like this...
addon = xbmcaddon.Addon()

This has implications with other variables in the code, such as "setting" which is assigned "addon.getSetting". If 'addon' is not set to openscrapers, then this 'setting' call will call the settings of whichever addon is accessing the scraper. So, for example, if Venom calls the new Easynews scraper, then the 'settings' calls in the Easynews scraper will check Venom's settings instead of Openscrapers settings.

I can fix this with a pull request, I just don't know whether it will affect the scrapers test code incorporated into Openscrapers.

OpenScrapers v 0.0.1.109

Thanks for the update , was this update to fix the cloudflare , cfscrape error , for me I'm still not getting any rlsbb links for some reason.

show Easynews & Furk links as 'Premium'

Currently Easynews & Furk results are treated as 'direct' sources by addons, this is a problem if using the 'use debrid only' filter (and possibly other sorting/filtering options).

Solution would be to consider these links 'premium' along with torrent/debrid rather than the 'free' links.

how to i add some scrapers from a addon

as title mentions i would like to be able to add some scrapers to openscrapers they are from a addon i have and i have tested to make sure they work and i have chacked to see if there is any duplicates and i have removed the duplicates as well

v.195, 2ddl and rapidmoviez

2ddl giving this error: http://2ddl.vg/ returned an error. Could not collect tokens.

and rapidmoviez requires "from openscrapers.modules import dom_parser2" but dom_parser2.py is not in modules, i was able to add my own dom_parser2 but i thought i would let you know.

vidics & xwatchseries

just curious why vidics and xwatchseries are not being used because im getting a lot of links with them for episodes ?

v197 en onlineseries

Sorry, I forgot to mention, that there is a copy of onlineseries in en that still calls for dom_parser2, the one in en_debrid_only is proper and i guess the one in en should be removed.
And just a thank you for creating and maintaining openscrapers ;)

PubFilmOnline

PubfilmOnline gives this error

UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("html.parser"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

The code that caused this warning is on line 56 of the file C:\Users\Grim\Documents\GitHub\script.module.openscrapers\lib\openscrapers\sources_openscrapers\en\pubfilmonline.py. To get rid of this warning, pass the additional argument 'features="html.parser"' to the BeautifulSoup constructor.

Foreign GERMAN providers

Hi.
Can you check the german providers list please?
I use exodus redux and it loads no links when i set openscraper to use german providers.
BTW when i use lambdascrapers all works.
I have notized that in lambdascrapers the list of german providers is completely different.

documentation for developing new scrapers

Hello, I have been developing some scrapers using Bsoup for a while. I am interested in developing scrapers capable of being integrated with openscrapers. Is there documentation with clear instructions on how to develop scrapers to your pattern?

CSV Export Seperator

Hey @nazegnl just merged your PR and tested scrape test, all CSV outputs are using ; instead of , so i i have to go into all files and change ; to , lol can you please look at it again?

Unresolved reference - Anilist

The following scrapers all reference the missing module anilist:

Animeloads
Proxer
Foxx
Pureanime

This module will either need to be found or the providers removed

Proposal for next update

I'm thinking next update to add the hash to our torrent sources dict. Would make things a little easier for devs doing torrent cached/uncached checking, and or removal.

Adjust file "default.py" and "addon.xml

Please excuse my poor English.

Hello,

is there a special reason why in the 'addon.xml' the line
<extension point="xbmc.python.pluginsource" library="lib/default.py">
is not
<extension point="xbmc.python.script" library="lib/default.py">
like other script modules?

I changed this for me and also changed the 'default.py' a little bit. The advantage is that after changing the settings, the settings are always saved when you click on the "OK" button.

As an example my "default.py":

default.py.txt

myvideolink

myvideolink may be broken, is anyone else getting errors from it ? thnx

Venom displaying episode names instead of show titles in TV Shows/Calendar

Seems to be displaying episode title rather than the name of the show. For example, Gold Rush: Parker's Trail show displayed as Hell's Crack: https://imgur.com/a/KN5rqjj

no result "https://movietown.org" branch "develop"

url: https://movietown.org

use: "develop" Kodi on Firetvstick
The module "cfscrap.py" from branch "develop" does NOT return a result.

use: "develop" Kodi on windows
The module "cfscrap.py" from branch "develop" does return a result.

use: "master"
The same module "cfscrap.py" from branch "master" does return in Kodi a result on FireTvSick and on Windows

Sorry for the short text - my english is bad
Thank you

url = https://movietown.org
import openscrapers
from openscrapers.modules import cfscrape
scraper = cfscrape.create_scraper()
sHtmlContent = scraper.get(url).content
print sHtmlContent

openscrapers settings revert on "ok"

Here is a strange one, for example if i open scraper settings and choose disable all torrent providers it flashes and toggles them all off then i click "ok" and it exits out then when i go back in the torrents are all toggled back on, but if i disable all and then hit cancel to close settings and then go back in my changes will have been saved.... its as if Ok and Cancel are acting in reverse ? im using kodi 18.1

Hello from Germany

The german (de) scraper are almost all broken and should be revised.

missing provider

solarmovie.py missing from EN

problems with many german scraper sites

Really many german scraper seems to be broken.
I used venom with only the foreign scrapers and set to german indexers within venom.

right now I have only found sources at iload, ddl(.me?) and streamto.
I know the searched series is at least at serienstream (s.to), freikino, hdfilme, kinox.to.

Could someone look into this?

Or anyone has a "easy" guide for making scrapers, than I could try it myself.
(Haven't done this before, nor used python much)

The primewire scraper could benefit from an update

Hi!
I think it would be wonderful if you could took a look at the primewire scraper and updated it.

Right now it scrapes https://www.primewire.ac/ which is DOWN (522 error) and https://primewire.ink/ which seems to be a clone... If I'm not mistaken..?

Instead please update it to scrape https://www.primewire.li/ or https://www.primewire.ag/ which seems to be updated EVERY day.
Thanks in advance!

Scrape Test Request & Issues

Skip scrapers that require login like GoStream, Furk, etc

@nazegnl

ultrahdindir.py (developer branch) still not working

ultrahdindir.py (i requested on Reddit a fix) is still not giving results back ,i replaced the old ultrahdindir.py with the new one from the developer branch, maybey you can also test this? Thx.

1.91 -> 1.93 ?

I don't understand the version 1.93 is available on openscrappers repo but not here ?

https://github.com/a4k-openproject/repository.openscrapers/tree/master/zips/script.module.openscrapers.

Thank for this addons :-)

[Suggestion]Adding Headers on requests

As title says, please add headers on the requests of cfscrape to hide kodi headers.
In the case of client.request(url) headers are created inside the request function of client module so for example you dont need to set User-Agent in headers, but on normal requests and cfscrape requests you need to set User-Agent and possibly the baseurl as referer of the scraper to hide that way requests from kodi!
for example:

scraper = cfscrape.create_scraper() headers = {'User-Agent': client.agent(), 'Referer': self.base_url} html = scraper.get(url, headers=headers).text

re v1.106 Some scrapers not working

v1.106
I dont use many free scrapers, i do use most debrid scrapers, of the scrapers i use i have had these issues, P.S. Thank you for all your efforts, im just trying to contribute:
[2020-03-17 05:37:04] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: Error: Loading module: "projectfreetv": cannot import name cfScraper[2020-03-17 05:37:06] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: Request-Error (500): http://www.sceneddl.me/?s=Riviera+S02E10
[2020-03-17 05:37:06] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: Request-Error: (unknown url type: Riviera) => Riviera
[2020-03-17 05:37:06] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: Request-Error: (unknown url type: Riviera) => Riviera
[2020-03-17 05:37:06] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: Request-Error: (unknown url type: Riviera) => Riviera
[2020-03-17 05:37:07] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: MYVIDEOLINK - Exception:
Traceback (most recent call last):
File "/Users/xxxxxxx/Library/Application Support/Kodi/addons/script.module.openscrapers/lib/openscrapers/sources_openscrapers/en_DebridOnly/myvideolink.py", line 107, in sources
posts = zip(client.parseDOM(r1, 'a', ret='href'), client.parseDOM(r1, 'a'), re.findall('((?:\d+.\d+|\d+,\d+|\d+)\s*(?:GB|GiB|MB|MiB))', r2[0]))
[2020-03-17 05:38:03] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
Cloudflare_reCaptcha_Provider: Cloudflare reCaptcha detected, unfortunately you haven't loaded an anti reCaptcha provider correctly via the 'recaptcha' parameter.
[2020-03-17 05:38:03] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
MissingSchema: Invalid URL 'None': No schema supplied. Perhaps you meant http://None?
[2020-03-17 05:39:10] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
Cloudflare_Loop_Protection: !!Loop Protection!! We have tried to solve 3 time(s) in a row.
[2020-03-17 05:39:10] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
Cloudflare_Loop_Protection: !!Loop Protection!! We have tried to solve 3 time(s) in a row.
[2020-03-17 05:39:10] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
Cloudflare_Loop_Protection: !!Loop Protection!! We have tried to solve 3 time(s) in a row.
[2020-03-17 05:39:10] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
Cloudflare_Loop_Protection: !!Loop Protection!! We have tried to solve 3 time(s) in a row.
[2020-03-17 05:39:14] [COLOR red][ OPENSCRAPERS DEBUG ][/COLOR]: RAPIDMOVIEZ - Exception:
Traceback (most recent call last):
Cloudflare_Loop_Protection: !!Loop Protection!! We have tried to solve 3 time(s) in a row.

Seems Rapidmoviez rarely passes CF...

Series9

@nazegnl
Series9 giving me this error on latest dev branch

Traceback (most recent call last):
File "C:\Users*\Documents\GitHub\script.module.openscrapers\lib\openscrapers\sources_openscrapers\en\series9.py", line 111, in sources
url = self.searchMovie(data['title'], data['year'])
File "C:\Users*\Documents\GitHub\script.module.openscrapers\lib\openscrapers\sources_openscrapers\en\series9.py", line 93, in searchMovie
url = [i[0] for i in results if cleantitle.get(i[1]) == cleantitle.get(title)][0]
IndexError: list index out of range

New function "get_titles_for_search()" for "source_utils.py"

Please transfer the following function "get_titles_for_search()" to the "source_utils.py

def get_titles_for_search(title, localtitle, aliases):
	try:
		titles = []
		if "country':" in str(aliases): aliases = aliases_to_array(aliases)
		if localtitle != '': titles.append(localtitle)
		if title != ''and title != localtitle: titles.append(title)
		[titles.append(i) for i in aliases if i.lower() != title.lower() and i.lower() != localtitle.lower() and i != '']
		titles = [str(i) for i in titles if all(ord(c) < 128 for c in i)]
		return titles
	except:
		return []

This function simplifies the writing of scrapres.
It creates a list from the transferred values in which there are no more duplicates.

as an example you see some codelines from a scraper:

old:

def movie(self, imdb, title, localtitle, aliases, year):
    try:
        url = self.__search([localtitle] + source_utils.aliases_to_array(aliases))
        if not url and title != localtitle: url = self.__search([title] + source_utils.aliases_to_array(aliases))
        return url
    except:
        return

new with "get_titles_for_search()"

def movie(self, imdb, title, localtitle, aliases, year):
    try:
        return self.__search(source_utils.get_titles_for_search(title, localtitle, aliases))
    except:
        return

many thanks

this repo trying to hijack or inject malicious code?

https://github.com/nusch/nusch-repo/tree/master/script.module.openscrapers

version numbers are 9.x.x.x

seems kinda shady, I don't have a reddit account but maybe someone that does can post warning on addons4kodi (people should have auto updates off anyway)

Some scrapers not working in 0.0.2.003

Like RLSbb, SceneRLS,Zoogle (just to name a few)
If i rollback to the previous version, they are working fine.

If you need a log, guide me.

Maybey you can replicate this issue also?

SEEHD lost to v2 challenge

Cloudflare v2 challenge has taken another from us. Will be removed in next update.

German modules partially broken and incomplete

HD-Streams.org and probably the rest does not give 1080p results and i also think the lower resolutions come from different pages.
In general i think they are largely outdated and we would also benefit from modules for:

- streamkiste.tv
- kinoz.to

Skytorrent issue

@host505 try latest dev or test-reformat branch. skytorrent is working for me just fine on both

OpenSSL error with cfscrape

Hello. Using openscrapers on Kodi 18.6 under windows 10.

When trying to scrape french yggtorrent website I get this error :

DEPRECATION: The OpenSSL being used by this python install (OpenSSL 1.0.2j 26 Sep 2016) does not meet the minimum supported version (>= OpenSSL 1.1.1) in order to support TLS 1.3 required by Cloudflare, You may encounter an unexpected reCaptcha or cloudflare 1020 blocks

And I can't bypass cloudfare protection.

Any idea ?

Digbt.py Crashes Dialog Box with Results

I am having an issues with digbt.py causing the dialog window to not show other links. I have tested it with a working one and that one only available in the folder and it will prevent the dialog box from coming up. if I comment out the self variables it will work correctly(with no links from digbt). I noticed it was a cf based and are always hard to fix at times. I just wanted to see if you can confirm as well.

Thanks

user_agents.py

v196 seem to be having trouble with user agents py, may just be me, had to revert to last cfscrape etc as scrapers cant load user_agents.py

a4k-openproject / script.module.openscrapers Goto Github PK

script.module.openscrapers's People

Contributors

Stargazers

Watchers

Forkers

script.module.openscrapers's Issues

Recommend Projects

Recommend Topics

Recommend Org