netdata / netdata-cloud Goto Github PK
View Code? Open in Web Editor NEWThe public repository of Netdata Cloud. Contribute with bug reports and feature requests.
License: GNU General Public License v3.0
The public repository of Netdata Cloud. Contribute with bug reports and feature requests.
License: GNU General Public License v3.0
Describe the bug
If i have alarms and anomlalies collector enabled, on cloud i dont see the icons or infor from dashboard_info.js
To Reproduce
Look at any agent that has alarms collector on it in netdata cloud
Expected behavior
THe icon and info should be displayed like it is in underlying agent dashboard
Screenshots
Error logs
Desktop (please complete the following information):
Additional context
internal slack discussion: https://netdata-cloud.slack.com/archives/CHH8X9M5J/p1610627563003700
info from @jacekkolasa: We need to manually add dashboard_info.js to the Dashboard's bundle for the cloud. The icons are there, it's just a matter of that outdated file. Also the header info is missing, under the title.
Describe the bug
I have a room with one node. This node about 20 anomalies collector jobs running on it and so has lots of anomalies contexts in the menu on the right.
When i am in the node itself i see correct menus:
But for some reason when i go to the overview for the room i lose those menu's and it has for some reason sort of flattened out all the "System Overview" sections into their own menus for some reason.
To Reproduce
Create a agent with a lot of non standard collectors and/or multiple jobs per collector. Chanage the priority of those jobs to be 80-90 and so appear at top of menu list.
Create a space and room with just that node. Look at the differences between the overview and the node view of the dashboard itself.
Expected behavior
I could see all the menu sections in the overview screen just like i do on the node screen.
Screenshots
As above
Additional context
I'm happy to invite anyone to my room if easier to debug that way as might be a tricky one to recreate.
Describe the bug
If i create a new room and then select a date time range that goes back in time covering before the room was created i see strange date axis behaviour.
To Reproduce
Create a new room, go into the room, press last 12 hours. You should see a longer time frame on the data axis than 12 hours.
Expected behavior
I expect to see last 12 hours
Screenshots
i just created a new room called "Redis" that has all the gke-production-redis nodes in it. Then i pressed last 12 hours and for some reason it's showing charts in some strange way but not the last 12 hours. It's like its trying to go much further back in time for some reason.
if i go to the General room it looks ok
But any of the 3 new rooms i made seem to have this issue:
So its almost like its just for new rooms for some reason the datetime picker does strange stuff.
e.g. last 30 minutes seems ok on the load chart but not on the cpu chart
its like its doing the math wrong and looking back further or something
last 2 hours:
its like the blank space outside what i'm asking for is growing with the size of the window from the datetime picker
its almost like if i request a time range that goes to before the room was created then i get this issue.
Describe the bug
When I build a new dashboard in Netdata Cloud, the charts seem to auto-resize on hover, which makes interacting with them quite difficult, and then reduces their height, which gives less precise metrics. I only see this happen in Firefox.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
A clear and concise description of what you expected to happen.
Screenshots
If applicable, add screenshots to help explain your problem.
Error logs
Desktop (please complete the following information):
Additional context
Add any other context about the problem here.
Describe the bug
War Room alarm belt does not show the War Room active alarm state
To Reproduce
Steps to reproduce the behavior: Within the War Room with active alarms, see the alarm bell
Expected behavior
If the War Room has active alarms, the bell icon should be showing the War Room alarm status with a yellow and a red dot.
Describe the bug
In overview with changing grouping the sane defaults are not kept
To Reproduce
Steps to reproduce the behavior: In Overview -> click "by node" on a chart -> then select a single dimension/node from the bottom of the chart -> switch to "by dimension". Sometimes the chart is empty.
Expected behavior
The default configuration for the selected grouping needs to be applied each time
Describe the bug
[BUG]On some composite charts the horizontal axis is jumping minutes, charts change shape and the users cannot follow. Also the resolution changes
To Reproduce
Steps to reproduce the behavior: Go to Overview -> Disks -> disk.io chart
Expected behavior
The horizontal axis should maintain a set timeframe specified by the date & time picker
Screenshots
See video
Screen Recording 2020-12-03 at 4.38.08 PM.mov.zip
Describe the bug
During user onboarding, after creating the first space and room the user is guided to a "Claim your first nodes in room " tab.
There are currently two issues with this tab:
a) The claim command should include two rooms (General and the one the user has just created)
b) The user should be able to remove General room from the Rooms tab, but currently cannot remove it (even if the claim command gets updated, the UI still show the General room in the bar).
To Reproduce
Follow onboarding for a new user in cloud.
Expected behavior
The user should be able to remove general room from the bar & see two rooms in the claim command shown initially.
Screenshots
Desktop (please complete the following information):
Additional context
Add any other context about the problem here.
Describe the bug
A clear and concise description of what the bug is.
To Reproduce
Steps to reproduce the behavior:
Login to app.netdata.cloud and click on more in the sidepanel
Expected behavior
A clear and concise description of what you expected to happen.
The button should be clickable either by clicking on the space at the right or on the text. The rest of the options function that way.
Describe the bug
Sometimes alarms are Incorrectly show as active while on the agent itself the alarm is cleared. Most of the times these alarms were triggered initially days before.
To Reproduce
Steps to reproduce the behavior: Not always reproducible
Expected behavior
Active alarms for nodes on Netdata Cloud should be the identical to the ones on the respective connected agents.
Describe the bug
When accessing the same node through Cloud or directly from Agent's Dashboard ,first_entry
is different. The difference varies - sometimes it's just 1 second, sometimes more than 2 hours.
I'm not sure if this bug is not only valid for our test infrastructure.
Expected behavior
We should see the same data/metadata from Agent Dashboard and Cloud's Node-view
Screenshots
https://staging.netdata.cloud/api/v1/nodes/226fdb02-d510-43e0-82b7-ba0a00440252/data?chart=system.io&_=1613650295832&format=json&points=90&group=average>ime=0&options=ms%7Cflip%7Cjsonwrap%7Cnonzero&after=-900
https://netdata.corp.staging.netdata.cloud/host/gke-staging-main-3dce75e9-8ou7/api/v1/data?chart=system.io&_=1613650295832&format=json&points=90&group=average>ime=0&options=ms%7Cflip%7Cjsonwrap%7Cnonzero&after=-900
Notice that latest_values
match, so it looks like it's the same node. All query params in the urls are the same.
Are we sure those are the same nodes?
The version
/history
/memory_mode
are different, but the hostname
matches.
Also, i've noticed that id
s of those nodes are changing in the Cloud - after some time, it seems to have restarted: The same hostname
now has new id, history is cleared, and the first_entry is matching or the difference is just 1 second. After some time (an hour) it desynchronises again.
Related to #47
Describe the bug
A clear and concise description of what the bug is.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
A clear and concise description of what you expected to happen.
Screenshots
If applicable, add screenshots to help explain your problem.
Error logs
Desktop (please complete the following information):
Additional context
Add any other context about the problem here.
Describe the bug
Changing the time period to using the timestamp and not the "Show X latest minutes/hours etc" appears to have no actual effect. The time reverts to it's original values when clicking on anything else.
To Reproduce
Check video.
Expected behavior
Accepting a custom time period for getting chart data.
Screenshots
Video attached.
https://user-images.githubusercontent.com/2836342/108420984-31435580-723d-11eb-8a3c-fef486ab229c.mp4
Error logs
API is not enabled in project settings
in dev console
Desktop (please complete the following information):
Describe the bug
System overview, node view, overview screens don't resize when we reduce the available width.
To Reproduce
Make the window smaller, or hit F12 with the console appearing on the right side.
Expected behavior
Resize the main contents of the screen
Error logs
Desktop (please complete the following information):
Describe the bug
I cannot save dashboards that have charts provided by 3rd party collectors.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
I expected it to save and persist between reloads.
Screenshots
As well as the screenshot attached below for the error that is thrown, I also recorded and uploaded this video reproducing the issue.
https://www.youtube.com/watch?v=u1t57t8tg-Q&feature=youtu.be
Desktop (please complete the following information):
Additional context
It has worked well for around 8-9 days until yesterday, I haven't done anything to netdata in that time.
I tried restarting the netdata service to see if that fixed the issue but no.
This happens in all War Rooms and dashboards, creating a new one doesn't fix the issue.
I tried on an incognito window (without extensions) and the issue still happens.
Describe the bug
Under alarm info sometimes the Calculation or the DB lookup sometimes is missing.
All active alarms should show either the Calculation or DB lookup.
It might well be that this happens because some alarms have DB lookup and not Calculation.
To Reproduce
Steps to reproduce the behavior: Go to an active alarm -> click on the 3-dot menu -> click on Alarm Info
Expected behavior
All active alarms should show either the Calculation or DB lookup
Screenshots
In this img the Critical is missing as well but cannot confirm it is a bug, no access to the agent dashboard or configuration.
Describe the bug
Sometimes in Cloud the reachability status is not correct:
To Reproduce
Steps to reproduce the behavior: No clear steps to reproduce
Expected behavior
The reachability status of the nodes should always be correct
Screenshots
Describe the bug
Alarm is showing for lowest_entropy in Netdata cloud but there is no alarm on the local node's web UI and there should be no alarm since I have disabled the entropy plugin.
The alarm is now 5 hours old.
To Reproduce
Agent v1.26.0-222-nightly
Trigger alarm for lowest_entropy.
Set "/proc/sys/kernel/random/entropy_avail = no" in "/etc/netdata/netdata.conf"
Restart agent.
Optionally: delete "/etc/netdata/health.d/entropy.conf"
Expected behavior
I expect the alarm status to:
a. be the same on local node web UI and cloud
b. For the cloud UI to clear the alarm
Screenshots
If applicable, add screenshots to help explain your problem.
Error logs
Logs included but am not sure how much use they'll be :)
I've generated a HAR file but don't fancy sharing that on a public tracker. Let me know if you think it'll help and I can ping it over to the support email address.
Desktop (please complete the following information):
Additional context
Have tried using Google Chrome as well as Edge Chromium but issue is on both.
Tried service restarts and even a full system reboot but still persists
app.netdata.cloud-1605907164843.log
Describe the bug
I receive a lot of emails because my node is unreachable and reachable again within minutes.
To Reproduce
Expected behavior
Node always reachable except when I stop the container.
Screenshots
If applicable, add screenshots to help explain your problem.
Additional details
netdata.conf
[global]
update every = 3
hostname = THOR
Describe the bug
On Cloud'S Nodes table there is the wrong value ("Bare Metal") for an LXC container.
To Reproduce
Steps to reproduce the behavior:
See https://community.netdata.cloud/t/best-way-to-setup-netdata-in-a-proxmox-host-with-some-lxc-containers/81
Expected behavior
When a netdata agent it is installed on an LXC It should say LXC, not bare metal.
Screenshots
See https://community.netdata.cloud/t/best-way-to-setup-netdata-in-a-proxmox-host-with-some-lxc-containers/81
Describe the bug
Node page not loading
To Reproduce
Click on a node detail like https://app.netdata.cloud/spaces/production-iqdkpt4/rooms/general/nodes/483ee1be-3cbb-47a0-a913-7d00facd5c96
Expected behavior
Page should render
Error logs
[Error] TypeError: undefined is not an object (evaluating 'Object(o.a)(r,n)')
hasOwnProperty
(funzione anonima) — main-344126e7afd18628ea00.js:199:1483564
(funzione anonima) — main-344126e7afd18628ea00.js:236:1708661
(funzione anonima) — main-344126e7afd18628ea00.js:199:1454311
(funzione anonima) — main-344126e7afd18628ea00.js:236:1708668
(funzione anonima) — commonmainnetdata_dashboard.e5eb42184e81835fbe6b.js:1:13762
v — commonmainnetdata_dashboard.e5eb42184e81835fbe6b.js:1:11769
(funzione anonima) — main-344126e7afd18628ea00.js:199:1619585
(funzione anonima) — main-344126e7afd18628ea00.js:199:1612626
f — main-344126e7afd18628ea00.js:199:1610279
y — main-344126e7afd18628ea00.js:199:1610468
p — main-344126e7afd18628ea00.js:199:1610328
(funzione anonima) — main-344126e7afd18628ea00.js:199:1612581
(funzione anonima) — main-344126e7afd18628ea00.js:199:1617305
m — main-344126e7afd18628ea00.js:199:1618263
p — main-344126e7afd18628ea00.js:199:1617774
c — main-344126e7afd18628ea00.js:199:1618046
promiseReactionJob
(funzione anonima) (main-344126e7afd18628ea00.js:408:3552814)
y (main-344126e7afd18628ea00.js:2:233276)
y (main-344126e7afd18628ea00.js:199:1616469)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
p (main-344126e7afd18628ea00.js:199:1617838)
c (main-344126e7afd18628ea00.js:199:1618046)
n (main-344126e7afd18628ea00.js:2:233931)
c (main-344126e7afd18628ea00.js:199:1618046)
y (main-344126e7afd18628ea00.js:199:1616568)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
y (main-344126e7afd18628ea00.js:199:1616568)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
y (main-344126e7afd18628ea00.js:199:1616568)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
p (main-344126e7afd18628ea00.js:199:1617838)
c (main-344126e7afd18628ea00.js:199:1618046)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1612652)
f (main-344126e7afd18628ea00.js:199:1610279)
y (main-344126e7afd18628ea00.js:199:1610468)
p (main-344126e7afd18628ea00.js:199:1610328)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1612581)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1617305)
m (main-344126e7afd18628ea00.js:199:1618263)
p (main-344126e7afd18628ea00.js:199:1617774)
c (main-344126e7afd18628ea00.js:199:1618046)
promiseReactionJob
[Error] TypeError: undefined is not an object (evaluating 'Object(o.a)(r,n)')
(funzione anonima) (main-344126e7afd18628ea00.js:408:3552814)
y (main-344126e7afd18628ea00.js:2:233276)
y (main-344126e7afd18628ea00.js:199:1616469)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
p (main-344126e7afd18628ea00.js:199:1617838)
c (main-344126e7afd18628ea00.js:199:1618046)
n (main-344126e7afd18628ea00.js:2:233931)
c (main-344126e7afd18628ea00.js:199:1618046)
y (main-344126e7afd18628ea00.js:199:1616568)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
y (main-344126e7afd18628ea00.js:199:1616568)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
y (main-344126e7afd18628ea00.js:199:1616568)
i (main-344126e7afd18628ea00.js:199:1615983)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1616070)
p (main-344126e7afd18628ea00.js:199:1617838)
c (main-344126e7afd18628ea00.js:199:1618046)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1612652)
f (main-344126e7afd18628ea00.js:199:1610279)
y (main-344126e7afd18628ea00.js:199:1610468)
p (main-344126e7afd18628ea00.js:199:1610328)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1612581)
(funzione anonima) (main-344126e7afd18628ea00.js:199:1617305)
m (main-344126e7afd18628ea00.js:199:1618263)
p (main-344126e7afd18628ea00.js:199:1617774)
c (main-344126e7afd18628ea00.js:199:1618046)
promiseReactionJob
Desktop (please complete the following information):
We use the node name as the only identifier shown on multiple screens. Of especial importance is the room nodes list, which we would normally use to remove nodes we don't want in the room.
If we continue to allow duplicate node names, we need to provide an additional identifier that allows users to distinguish between multiple entries with the same name, so they can at least remove from the room the ones that are problematic.
Describe the bug
Pinned&minimized dashboard does not get updated when adding charts from views other than the dashboard itself.
To Reproduce
Steps to reproduce the behavior: Go to single node, add a chart in a dashboard while the dashboard is pinned&minimized. The dashboard does not get updated
Expected behavior
Pinned & minimized dashboards should be automatically updated after addition of a chart from outside the dashboard.
Describe the bug
A clear and concise description of what the bug is.
To Reproduce
Steps to reproduce the behavior: Drag a card out of the dashboard. https://www.youtube.com/watch?v=VtAZu_sFoQ4&feature=youtu.be&ab_channel=max232
Expected behavior
The user should be able to see the card which goes out
Does not matter if I add cpu chart before bandwidth chart, cpu.cpu# charts will always be wiped when save button has been hit
Have deleted Dashboard and recreated multiple times
We have a few problems with the groupings shown above in the overview screen:
Describe the bug
To Reproduce
Steps to reproduce the behavior:
Desktop (please complete the following information):
Describe the bug
I have an undismissable empty alert on my Netdata Cloud dashboard. I believe it is from a collector I disabled (powersupply_capacity.battery
). It does not appear on the node's local dashboard.
To Reproduce
-Set up a new Netdata agent
-Immediately disable a collector that will always trigger an alert
-Claim node in Netdata Cloud
-View Netdata Cloud dashboard
Expected behavior
No (empty) alerts for disabled collectors
Describe the bug
When attempting to log in to Netdata Cloud using a GitHub Account when you have not already linked a GitHub Account to your Netdata Cloud account, canceling the authentication flow from the GitHub side results in a generic 404 error page in Netdata Cloud.
To Reproduce
Expected behavior
The user should be presented with a dedicated error page for this situation with a link to return to the login page.
Screenshots
n/a
Error logs
https://aurek.ahferroin7.net/pub/app.netdata.cloud.har
Desktop:
Also confirmed on Microsoft Edge 87.0.664.75 on the same system.
Describe the bug
Zoomed into an "overview" chart that was "by node" and the chart stopped working due to invalid data
calls
To Reproduce
Zoom In/Out and see if "after" is always an integer, or if there's a chance of it becoming a float.
Expected behavior
Zooming in/out doesn't break the chart
Screenshots
If applicable, add screenshots to help explain your problem.
Error logs
{"errorMsgKey":"ErrParsingRequestBody","errorMessage":"json: cannot unmarshal number -361.6898148148149 into Go struct field dataRequestBody.after of type int64","errorCode":"lk20X34j9o-061919"}
Desktop (please complete the following information):
Describe the bug
Agent user is signed-in but the Spaces and War Rooms are missing from the left bar and panel
To Reproduce
Steps to reproduce the behavior: Not always reproducible
Expected behavior
The user is expected to see the left bar and panel populated with the available Spaces and War Rooms after being signed-in
Describe the bug
On nodes table
To Reproduce
Steps to reproduce the behavior: Go to "Nodes table" -> Remove an instance or family(software or hardware resources) from a claimed agent -> Restart the agent -> See the sparklines in the node view
Expected behavior
Deleted software or hardware resources from nodes should be removed from Cloud if they were deleted and were not existing within the selected timeframe
Describe the bug
When running metric correlations sometimes it takes a very long time to get the results. Some other times requests time out.
To Reproduce
Steps to reproduce the behavior: Go to a single node view -> Click "Metric Correlations" -> Select an area of interest in a chart -> Click "Find Correlations". Not always reproducible
Expected behavior
Metric correlations should show results in less than 15 seconds.
Cc @dim08
Describe the bug
In the response of the /charts
endpoint in the room overview screen there are 2 keys cpufreq.cpufre
cpuidle.cpuidle
that are getting display in the sidebar as primary menu options.
To Reproduce
Most of the production spaces in the room overview page contain this response.
Expected behavior
Should be prefixed by the cpu.
key, so they are grouped under the cpu primary menu.
Something like cpu.freq
Screenshots
Error logs
Part of the /charts
response
Describe the bug
On the single agent view I updated the time picker to the last 6 hours, the charts were updated, but the control continues to show 15minutes. Tried it several times with different time selections, same thing
To Reproduce
Change the time window to anything other than 15min
Expected behavior
The time picker shows the selected time window, not 15min
Screenshots
If applicable, add screenshots to help explain your problem.
Error logs
No console errors shown
Desktop (please complete the following information):
Sometimes, alarms don't reset in the cloud, even if we restart the agent. Here you can see the effect with the dashboard in cloud:
At the same time, the alarms grabbed through the CLI are empty, which is the correct situation:
Please not the age of the alarms displayed in the cloud, the seem to last forever.
Describe the bug
I see an alarm on netdata but when I try to click on it from the numbered link in the war room list of nodes the browser returns a blank page. When I access it from the active alarms list, only the node and raise date are reported (no other information) and if I view alarm info they are empty. On the agent console no alarm is reported. It seems the synch between agent and cloud information is "broken".
To Reproduce
I have no idea on how this issue occurred.
Expected behavior
I would expect either detailed alarm info or no alarm and the same info in cloud and agent.
Screenshots
Link to screenshots: https://www.dropbox.com/s/iykalzaf63ec70v/Screenshot%202021-01-15%20181914.zip?dl=0
Error logs
Access to XMLHttpRequest at 'https://www.google-analytics.com/j/collect?v=1&_v=j87&a=1067901606&t=pageview&_s=1&dl=https%3A%2F%2Fapp.netdata.cloud%2Fspaces%2Figroup%2Frooms%2Fseregno%2Fnodes&ul=en-gb&de=UTF-8&dt=Netdata%20Cloud&sd=24-bit&sr=1920x1080&vp=1920x970&je=0&_u=QACAAEABAAAAAC~&jid=284583430&gjid=2177631&cid=1399776216.1604608692&tid=UA-64295674-3&_gid=1574353512.1610433909&_r=1>m=2wg161N6CBMJD&z=995482689' from origin 'https://app.netdata.cloud' has been blocked by CORS policy: Response to preflight request doesn't pass access control check: The value of the 'Access-Control-Allow-Origin' header in the response must not be the wildcard '*' when the request's credentials mode is 'include'. The credentials mode of requests initiated by the XMLHttpRequest is controlled by the withCredentials attribute.
Desktop (please complete the following information):
Additional context
Add any other context about the problem here.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.