GPT-4 gains eyes — the model that made multimodal mainstream
GPT-4 enhanced with vision capabilities for image understanding