End-to-End AI Testing and Benchmarking: Backend, Frontend, Security