V originále
Reducing emergency response times is critical to enhancing the efficiency of integrated rescue systems (IRS) and mitigating the impact of crisis events. This study investigates the deployment of intelligent sound event detection (SED) systems capable of recognizing specific sounds, such as gunshots and shouting, within public and commercial spaces. Through controlled simulations in an airport administrative building, the research demonstrates that SED systems significantly outperform traditional notification methods, reducing average response times by over 97%—from 175 seconds to just 5 seconds. These findings highlight the potential of SED systems to revolutionize emergency response strategies. The study introduces a novel approach by integrating sound detection with video surveillance into multimodal systems. This combination enhances situational awareness and allows for more precise responses to emergencies, addressing limitations of standalone detection systems. However, the study acknowledges key limitations—primarily that SED systems are less effective in silent incidents. The results emphasize the scalability of SED systems for diverse real-world applications in critical locations such as public institutions, shopping centers, and transportation hubs, where rapid decision-making is essential. Future research should explore optimizing these systems for noisy and unpredictable environments and advancing machine learning algorithms to improve reliability, adaptability, and detection accuracy, ensuring robust crisis management in varied scenarios.