mirkoschubert / gdpr-cli Goto Github PK
View Code? Open in Web Editor NEWA command line tool for checking your website for GDPR compliance.
License: MIT License
A command line tool for checking your website for GDPR compliance.
License: MIT License
Describe the bug
--fonts
does not initialize the JSParser as it needs it
To Reproduce
Steps to reproduce the behavior:
$ node index.js scan example.com -f
fonts
[21:04:54] Checking the URL ...
[21:04:54] Loading HTML file
[21:04:54] Setup additional content
[21:04:54] Loading CSS files
GENERAL INFORMATION
Title: Example Domain
Description:
URL: http://example.com/
Software: Unknown
Cannot read property 'hasTypekitJS' of undefined
Expected behavior
fonts should be detected
Screenshots
Desktop:
Additional context
I saw that you mentioned my reserved package name ;-)
Due to naming conventions in npmjs.com we changed our name from gdpr-check to gdpr-cli
Are still interested in using gdpr-check
? I would be also interested in providing further checks, solutions and contribute to the project.
Unfortunately I did not get any email or contact request asking about the usage of gdpr-check
. In general everyone can contact me and ask about unused package names and so on, I do not bite ;-)
I had the idea to create a similar tool but use puppeteer.
Puppeteer has a rich API for getting information about cookies, all requests, emulate languages, devices and much more.
The DNS checks and others run multiple regex match tests.
The check should be done after the first hit and return
in this case or continue
.
See getPrefetchingInformation
http://jsben.ch/1wwBN
https://jsperf.com/substring-search2
const Benchmark = require('benchmark')
const suite = new Benchmark.Suite
suite.add('match', {
setup: function () {
var dns = [
"23423423",
"gravatar.com",
"asdasdsd",
"34534",
"67867"
]
},
fn: function () {
dns.forEach((u, i) => {
let explanation;
if (u.match(/fonts\.googleapis\.com/) !== null) explanation = 'Google Fonts';
if (u.match(/gravatar\.com/) !== null) explanation = 'asdasd';
});
}
})
.add('indexOf', {
setup: function () {
var dns = [
"23423423",
"gravatar.com",
"asdasdsd",
"34534",
"67867"
]
},
fn: function () {
dns.forEach((u, i) => {
let explanation;
if (u.indexOf('fonts.googleapis.com') >= 0) explanation = 'Google Fonts';
if (u.indexOf('gravatar.com') >= 0) explanation = 'asdasd';
});
}
})
.add('includes', {
setup: function () {
var dns = [
"23423423",
"gravatar.com",
"asdasdsd",
"34534",
"67867"
]
},
fn: function () {
dns.forEach((u, i) => {
let explanation;
if (u.includes('fonts.googleapis.com')) explanation = 'Google Fonts';
if (u.includes('gravatar.com')) explanation = 'asdasd';
});
}
})
.on('cycle', function (event) {
console.log(String(event.target));
})
.on('complete', function () {
console.log('Fastest is ' + this.filter('fastest').map('name'))
})
.run({
'async': false
})
match x 3,490,973 ops/sec ±0.40% (96 runs sampled)
indexOf x 77,547,624 ops/sec ±1.18% (93 runs sampled)
includes x 11,250,688 ops/sec ±1.15% (90 runs sampled)
Fastest is indexOf
We need to analyse websites against embedded videos usage
I have already implemented the needed feature
cf. https://github.com/betaglop/gdpr-cli/tree/dev
Additional context
I cannot propose a pull request as other pull requests are pending and needed for the patch I can propose...
Hi,
Wenn ich eine Seite scannen möchte bekomme ich jedesmal:
Checking the URL ...
URL is not defined
Die Seiten die ich scanne gibt es natürlich.
Describe the bug
Scan fails with SyntaxError: Unexpected token ...
To Reproduce
Steps to reproduce the behavior:
Expected behavior
gdpr-cli scans www.example.com
Desktop:
Additional context
Stacktrace:
`$ gdpr scan www.example.com
/usr/local/lib/node_modules/gdpr-cli/lib/ui.js:59
cui.div(...data);
^^^
SyntaxError: Unexpected token ...
at exports.runInThisContext (vm.js:53:16)
at Module._compile (module.js:374:25)
at Object.Module._extensions..js (module.js:417:10)
at Module.load (module.js:344:32)
at Function.Module._load (module.js:301:12)
at Module.require (module.js:354:17)
at require (internal/module.js:12:17)
at Object. (/usr/local/lib/node_modules/gdpr-cli/lib/tasks.js:12:12)
at Module._compile (module.js:410:26)
at Object.Module._extensions..js (module.js:417:10)
`
I'd like to automate Cookies detection and GDPR compliance evaluation (record-keeping, eg. 13 month for Cookies in France).
I'd like to get back informations about cookies and cookies record-keeping policies on the targeted website. It's easy to do that using bash (I've already written something that way), so I'd like to see it in this tool, maybe by contributing...
Eg. using bash:
FILE=/path/to/cookies/file
wget --save-cookies $FILE [URL]
MONTH13=34164000 # nb of seconds in 13 month
for epoch in `cat $FILE | grep -v '^#' | grep -v '^$' | cut -d" " -f5`
do
if [ $epoch -gt `echo \`date +%s\` + $MONTH13 | bc` ]
then
DATE=`date --date=@$epoch`
echo "- KO: $DATE"
continue;
fi
echo -n "- OK: "
date --date=@$epoch
done
Describe the bug
Scanning sites utilizing the Simple Google Analytics plugin return - No Analytics Tool has been found.
Reproduced on multiple sites
Below is the code with the analytics at the bottom of the page
-- | <script type="text/javascript">panopress.imagebox();</script> | | <script type="text/javascript"> | var _gaq = _gaq \|\| []; | _gaq.push(['_setAccount','UA-123456-7']); | _gaq.push(['_trackPageview']); | (function() { | var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; | ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; | var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); | })(); | </script> | | <script type='text/javascript' src='http://www.site.com/wp-includes/js/wp-embed.min.js?ver=4.9.6'></script> |
Plugin; https://wordpress.org/plugins/simple-google-analytics/
Is your feature request related to a problem? Please describe.
Your tool should be able to detect CDN connection.
Cloudflare is used by many non-professional users for free SSL and cache.
However, Cloudflare itself is very dangerous, government funded MITMed point.
Therefore your tool should notify the user - "Don't forget to write 'We also share your IP and posted data with Cloudflare'!."
Describe the solution you'd like
Detect any CDN connection.
Describe alternatives you've considered
Suggest non-CDN solution. Let's Encrypt for certificate, for example.
Additional context
https://trac.torproject.org/projects/tor/ticket/24351
https://trac.torproject.org/projects/tor/ticket/18361
Is your feature request related to a problem? Please describe.
Describe the solution you'd like
A more functional programming approach for better maintainability and readability.
Describe alternatives you've considered
Some ideas / modules which might be useful here:
https://github.com/DanielRuf/https-checks/blob/master/index.js
https://github.com/DanielRuf/symantecpkicerts/blob/master/index.js
https://github.com/DanielRuf/website-stack/blob/master/index.js
https://github.com/DanielRuf/website-checks/blob/master/index.js
Additional context
Describe the bug
Google Analytics is not detected
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Google Analytics is detected
Desktop:
Additional context
ANALYTICS
No Analytics Tool has been found.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.