Incident of 10/20/2025

On October 20th, our services experienced significant degradation due to a major outage in the us-east-1 region of Amazon Web Services (AWS), which affected the availability of compute resources (EC2). During the event, AWS stopped provisioning new instances, preventing our infrastructure from scaling and maintaining normal operational capacity.
In short, our database remained operational, but the servers handling user requests began to saturate and could not increase capacity.
Although the root cause of the incident was entirely external to Sytex, the global impact of the AWS outage highlighted opportunities to strengthen our operational resilience. During the event, we deployed a contingency compute cluster in the us-east-2 (Ohio) region, which allowed us to restore service continuity. We are currently optimizing this process so that, in similar scenarios, failover occurs more quickly and with minimal downtime.
We are also implementing structural improvements to ensure we are prepared should a similar situation arise again. Actions already underway include:
  • Optimization of our disaster recovery process, achieving faster failover times.
  • Evaluation of a multi-region strategy to ensure high availability in the event of regional failures.
We apologize for any inconvenience caused and reaffirm our commitment to the reliability and operational stability of the platform.


Answers to questions raised by our users

Why don’t you have operational redundancy?

We do have operational redundancy.AWS provides  full redundancy across Availability Zones (AZ) .
Each Availability Zone includes redundant networking, power, and storage resources.Sytex’s infrastructure is deployed across multiple Availability Zones.The outage on October 20th exceeded this layer of protection.

Why don’t you have redundancy across regions?

Cross-region redundancy adds latency and operational costs that, until now, we considered unjustified given the level of security offered by multi-AZ deployments.Despite the rarity of such events, we are now evaluating a multi-region deployment of compute resources.

Why don’t you have multi-cloud redundancy?

Sytex operates with a transactional persistence model that makes multi-cloud operations highly complex.However, this remains our final line of defense.
In addition to storing backups in an  AWS air-gapped vault , we also replicate persistent data in another cloud provider to recover operational capacity in the event of a catastrophic incident.