Locked History Actions

Diff for "ilHadoopStatus"

Differences between revisions 1 and 26 (spanning 25 versions)
Revision 1 as of 2014-12-04 02:57:21
Size: 1219
Editor: akrevl
Comment:
Revision 26 as of 2015-05-05 22:42:22
Size: 2095
Editor: akrevl
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:

##{{{#!wiki warning
##'''Offline'''
##
##The cluster is currently not operational. See notes below.
##}}}

##{{{#!wiki note
##'''Monitoring'''
##
##Ganglia monitoring is currently having some issues.
##}}}

==== Cluster status ====

|| 2015-05-05 15:40:00 PDT || Cluster is online. Problem fixed. ||
|| 2015-05-05 10:05:00 PDT || Cluster is online but jobs are not getting processed. Looking into the issue. ||
|| 2015-04-30 11:40:00 PDT || Cluster is online. Upgraded to CDH 5.4.0 and JDK 8u45. ||
|| 2015-04-29 16:30:00 PDT || Cluster is offline due to package upgrades (moving to CDH 5.4.0). ||
|| 2015-04-09 06:42:00 PDT || Cluster is online. ||
|| 2015-04-08 06:00:00 PDT || Cluster is offline due to NameNode issues. Investigating + trying to repair the problem. ||
|| 2014-12-05 22:32:00 PST || Cluster is online. ||
|| 2014-12-04 21:05:00 PST || Maintenance in progress. Installing the secondary namenode, Spark, etc. ||
|| 2014-12-04 06:28:00 PST || Ganglia back to normal. Cluster is online. ||
|| 2014-12-04 05:40:00 PST || Ganglia reporting is currently having issues. ||
|| 2014-12-04 05:18:00 PST || Cluster is back online. ||
|| 2014-12-03 19:15:00 PST || Cluster offline for system updates. Should be back online by --(22:00 PST)-- 2014-12-04 02:00 PST. ||

==== Cluster usage statistics ====

|| '''CPU load''' <<BR>> [%] || '''RAM used''' <<BR>> [bytes] ||
|| <<Ganglia(hadoop_cpu,ilHadoop,)>> || <<Ganglia(hadoop_mem,ilHadoop,)>> ||
|| '''NETWORK traffic''' <<Color2(IN,green,fontWeight=bold)>> <<Color2(OUT,blue,fontWeight=bold)>> <<BR>> [bytes/s] || '''DISK read''' <<BR>> [bytes/s] || '''DISK write''' <<BR>> [bytes/s] ||
|| <<Ganglia(hadoop_network,ilHadoop,)>> || <<Ganglia(hadoop_disk,ilHadoop,,read,sda*sdb*sdc*sdd)>> || <<Ganglia(hadoop_disk,ilHadoop,,write,sda*sdb*sdc*sdd)>> ||
Line 5: Line 39:
 * [[ilHadoopStats|Hadoop stats]]  * [[ilHadoopStats|Hadoop Cluster Statistics]]
Line 8: Line 42:

==== Current usage ====

||<v:rowspan=2> '''Server''' ||<v:rowspan=2> '''OS''' ||<v:rowspan=2> '''MEMORY''' <<BR>>[GB] ||<v:colspan=6> '''CPU'''||<v:rowspan=2> '''Storage''' <<BR>> [TB] ||<v:rowspan=2> '''Login''' ||<v:rowspan=2> '''CPU load''' <<BR>> [%] ||<v:rowspan=2> '''RAM used''' <<BR>> [bytes] ||<v:rowspan=2> '''NETWORK traffic''' <<Color2(IN,green,fontWeight=bold)>> <<Color2(OUT,blue,fontWeight=bold)>> <<BR>> [bytes/s] ||<v:rowspan=2> '''DISK read''' <<BR>> [bytes/s] ||<v:rowspan=2> '''DISK write''' <<BR>> [bytes/s] ||
|| Type || Arch || Clock || CPUs || Cores || Threads ||
||<v:rowspan=4> ilHadoop1 <<BR>> ilh01 <<BR>> | <<BR>> ilh40 || CentOS 6.5 || 2560 || Opteron 6320 || 64 bit || || 40 || 320 || 320 || 320 || CS || <<Ganglia(hadoop_cpu,ilHadoop,)>> || <<Ganglia(hadoop_mem,ilHadoop,)>> || <<Ganglia(hadoop_network,ilHadoop,)>> || <<Ganglia(hadoop_disk,ilHadoop,,read,sda*sdb*sdc*sdd)>> || <<Ganglia(hadoop_disk,ilHadoop,,write,sda*sdb*sdc*sdd)>> ||

Hadoop Cluster Status

Cluster status

2015-05-05 15:40:00 PDT

Cluster is online. Problem fixed.

2015-05-05 10:05:00 PDT

Cluster is online but jobs are not getting processed. Looking into the issue.

2015-04-30 11:40:00 PDT

Cluster is online. Upgraded to CDH 5.4.0 and JDK 8u45.

2015-04-29 16:30:00 PDT

Cluster is offline due to package upgrades (moving to CDH 5.4.0).

2015-04-09 06:42:00 PDT

Cluster is online.

2015-04-08 06:00:00 PDT

Cluster is offline due to NameNode issues. Investigating + trying to repair the problem.

2014-12-05 22:32:00 PST

Cluster is online.

2014-12-04 21:05:00 PST

Maintenance in progress. Installing the secondary namenode, Spark, etc.

2014-12-04 06:28:00 PST

Ganglia back to normal. Cluster is online.

2014-12-04 05:40:00 PST

Ganglia reporting is currently having issues.

2014-12-04 05:18:00 PST

Cluster is back online.

2014-12-03 19:15:00 PST

Cluster offline for system updates. Should be back online by 22:00 PST 2014-12-04 02:00 PST.

Cluster usage statistics

CPU load
[%]

RAM used
[bytes]

NETWORK traffic IN

OUT


[bytes/s]

DISK read
[bytes/s]

DISK write
[bytes/s]

References