I’ve been drafting this post for a really long time. Like most posts, it’s largely for me to get some thoughts down. It’s also very related to the topic I’ll be talking about at Velocity later this year. When I gave a keynote talk at the Surge Conference last year, I talked about how our…
Talk from SREcon2016 by Brendan Gregg. Video: https://www.usenix.org/conference/srecon16/program/presentation/gregg . "There's limited time for performance ana…
lazyant 12 hours ago | parent | flag | favorite | on: I usually run 'w' first when troubleshooting unkno...
My goto for initial troubleshooting a server is:
uptime # uptime and CPU stress
w # or better yet:last |head # who is/has been in
netstat -tlpn # find server role
df -h # out of disk space?
grep kill /var/log/messages # out of memory?
ps auxf # what's running
htop # stressed? , look out for D (waiting on I/O typically) processes
history # what has changed recently
tail /var/log/application.log # anything interesting logged?
C. Kulkarni, J. Cambre, Y. Kotturi, M. Bernstein, and S. Klemmer. Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work &\#38; Social Computing, page 1116--1128. New York, NY, USA, ACM, (2015)
C. Kulkarni, M. Bernstein, and S. Klemmer. Proceedings of the Second (2015) ACM Conference on Learning @ Scale, page 75--84. New York, NY, USA, ACM, (2015)
J. Wachs, A. Hannák, A. Vörös, and B. Daróczy. (2017)cite arxiv:1705.02972Comment: in The International AAAI Conference on Web and Social Media (ICWSM2017), Montreal, May 2017.
B. Yang, T. Condie, S. Kamvar, and H. Garcia-Molina. Distributed Computing Systems, 2005. ICDCS 2005. Proceedings. 25th IEEE International Conference on, page 91--100. IEEE, (2005)
T. Condie, S. Kamvar, and H. Garcia-Molina. Proceedings. Fourth International Conference on Peer-to-Peer Computing, 2004. Proceedings., page 53-62. (August 2004)