return to table of content
FastVLM: Efficient vision encoding for vision language models
72 comments