Pinned Loading
-
multi-rag-quorum
multi-rag-quorum PublicMulti RAG Quorum — adversarial multi-retriever system for grounded question answering
Python 1
-
lm-evaluation-harness
lm-evaluation-harness PublicForked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Python 1
-
promptfoo
promptfoo PublicForked from promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line …
TypeScript 1
If the problem persists, check the GitHub status page or contact support.