3 Layers of troubleshooting
- Physical
- Power
- Dead ports/ cables
- Copper speed mismatch
- Fiber mode or wavelength mismatch
- Networking
- Firewall Rules
- Switch Configs
- Services
- Service status
- Configuration Files
Methodology
- Verify The Problem
- Determine which layer the problem exists at
- Rule out layers the problem could not be at
- Start with easiest issues that can be checked (power, cabling, service status)
- Step through the kits data flow
- Data is like water, the networks data has specific checkpoints it travels through before showing up in your SIEM.
Diagnosing the problem
-
Useful tools
- IPMI - Intelligent Platform Management Interface - Out of band server management
- Dell = iDRAC
- HP = iLO
- Supermicro = BMC
- Splunk btool
- ./splunk btool check
- ./splunk btool inputs list
- Elastic API
- Kibana - GET /_cluster/health
- Browser - https://master.dmssn.lan:9200/_cluster/health
- Curl - curl -X GET "master.dmssn.lan:9200/_cluster/health"
- More API's: https://www.elastic.co/guide/en/elasticsearch/reference/current/rest-apis.html
- IPMI - Intelligent Platform Management Interface - Out of band server management
-
Researching the Problem
- Support Pages
- Book of zeek: https://docs.zeek.org/en/master/
- Suricata docs: https://suricata.readthedocs.io/en/suricata-6.0.10/
- Zeek docs: https://readthedocs.org/projects/zeek-docs/
- Stack Overflow
- Splunkbase
-
Configuration Files
- Stop service if it's running
- Save old configuration file (e.x. file.conf.old) before changing configs
- Execute fix
- Restart service
- Verify fix