Vision-Language Foundation Models