Distributed system comes with challenges!
It is essential that you know the answers to these basic questions before going live and releasing your system.
• Is the Cloud infrastructure reliable and correctly configured for your solution?
• Is the System solution stable? Does it perform well with peaks and normal loads?
• Processing of big data, saving to Database and processing queues all work well when system exposed to Load for longer duration?
• Find if your solution can scale-up and scale-down as expected?
• How does the sub-systems perform under load and stress? Are they Reliable enough?
• What is the minimum infrastructure requirement to support the expected traffic and messages big data?
• How the system recovers if a
component/sub-component malfunction or crashes? How much is the downtime in case
of restart? How long does the system takes to come to full capacity?
• How are the Queues and topics behave when
normal loads exceeded? Batch reading of messages need fine tuning?
• Is your database scaling up as designed? What about the CPU utilization and queries executions under continuous endurance test for couple of hours?
• Are the database queries performing
optimally? Any index recommendations ?
• How many PODS do you need to keep system
performing optimally. Are you aware of the cost factor behind running infrastructure to this limits?
• How is the system performing in peak moments? Is the scaling scaling up correctly? Any bottlenecks or delays faced?
• How is the scale down functioning? Is the cloud solution costing too much when no load?
• What about the impact on third-party components and services? Auto service recovery and service dependencies?
• Is there any loss of messages?
• How does the system react to unhappy scenarios with corrupt data or delay in any sub-component processing? What about error handling?
• How efficient is the authentication functionalities when system under stress?
• How stable is the Broker and component integration.
Comments
Post a Comment