Exploring the performance and robustness of Vision Language models for Image Quality Assessment