It was reported that CH240 Blade installed in one of the E9000 chasis recently purchased by the customer is showing alarm that "CPU1 Thermal Trip", "Configuration Error".
Surely the issue is related to hardware so had to check various things that woulld have been a possible cause of this issue which includes:
1: Check whether fan modules were working fine and no alarm about temperature or Fan module failure is been reported.
2: Cooling facility in datacenter is working fine.
3: Finally check whether there is some issue with hardware inside the blade.
Had to take downtime from customer and plugout the server. The issue was with the CPU1 so what I removed the Heat sink from CPU1 and I waited for 5 to 8 minutes before installing it back.
Heat sink might have been possible cause of this issue so this was the first thing i had too check. After this step I placed the server back and powered it on to check if the alarm is cleared or not.
Upon checking i foundout thtat the alarm was cleared and CPU was back in the normal state.