Distributed system comes with challenges!



 

It is essential that you know the answers to these basic questions before going live and releasing your system. 


Is the Cloud infrastructure reliable and correctly configured for your solution?

Is the System solution stable? Does it perform well with peaks and normal loads?

Processing of big data, saving to Database and processing queues all work well when system exposed to Load for longer duration?

Find if your solution can scale-up and scale-down as expected?

How does the sub-systems perform under load and stress? Are they Reliable enough?

What is the minimum infrastructure requirement to support the expected traffic and messages big data?

How the system recovers if a component/sub-component malfunction or crashes? How much is the downtime in case of restart? How long does the system takes to come to full capacity?

How are the Queues and topics behave when normal loads exceeded? Batch reading of messages need fine tuning?

Is your database scaling up as designed? What about the CPU utilization and queries executions under continuous endurance test for couple of hours?

Are the database queries performing optimally? Any index recommendations ?

How many PODS do you need to keep system performing optimally. Are you aware of the cost factor behind running infrastructure to this limits?

How is the system performing in peak moments? Is the scaling scaling up correctly? Any bottlenecks or delays faced?

How is the scale down functioning? Is the cloud solution costing too much when no load?

What about the impact on third-party components and services? Auto service recovery and service dependencies?

Is there any loss of messages?

How does the system react to unhappy scenarios with corrupt data or delay in any sub-component processing? What about error handling?

How efficient is the authentication functionalities when system under stress?

How stable is the Broker and component integration.

How efficient is the queue processing when one component is down for 1 to n minutes? How long it takes to Recovery and come back to satisfactory performance? Or does it recover at all?

Comments

Popular posts from this blog

Top 5 API (Application Programming Interface) testing tools in the market?

Programming languages most in demand in the Netherlands?

Top 10 Meest Gevraagde IT-banen in Nederland in 2025