llama-3.2-11B-cot-separate-steps
val_100_ex.json
image → text
Groq
Llama 3.2 11B Vision (Preview)
caption
{imgname} I have an image and a question that I want you to answer. Caption the image in detail. Describe the contents of the image, specifically focusing on details relevant to the question. Question: {query} Caption:
Dec 6, 2024, 5:07 PM UTC
Dec 6, 2024, 5:07 PM UTC
5 row sample
1756 tokens$ 0.0003
5 rows processed, 1756 tokens used ($0.0003)
Estimated cost for all 100 rows: $0.0063Sample Resultscompleted
9 columns, 1-5 of 100 rows