Skip to content

Configuration

QQL stores connection config in ~/.qql/config.json. It is written at connect time and read on every subsequent command.

ModeDense vectorsSparse vectorsUse when
cloudQdrant Cloud server-side inferenceQdrant CloudRunning on Qdrant Cloud
localLocal OpenAI-compatible APIRepo BM25 + Qdrant sparse IDFSelf-hosted Qdrant + local embeddings (LM Studio, Ollama, llamafile)
externalRemote OpenAI-compatible APISame as localRemote Qdrant + remote embedding service
FlagDefaultDescription
--urlQdrant gRPC URL. Must use port 6334, not 6333.
--secretQdrant API key (Qdrant Cloud)
--inference-modecloudcloud, local, or external
--embedding-endpointOpenAI-compatible /v1/embeddings URL (local/external mode)
--embedding-keyBearer token for hosted embedding providers (optional)
--embedding-modelModel name sent in the embeddings request
--embedding-dimensionautoVector dimension. Auto-probed if reachable.
Cloud connect
qql-go connect --url https://<cluster>.qdrant.io --secret <api-key>

Qdrant Cloud handles all vectorization server-side. No embedding endpoint needed.

Local connect (macOS / Linux)
qql-go connect
--url http://localhost:6334
--inference-mode local
--embedding-endpoint http://127.0.0.1:1234/v1/embeddings
--embedding-key <optional-api-key>
--embedding-model text-embedding-all-minilm-l6-v2-embedding
--embedding-dimension 384
Local connect (Windows)
qql-go connect --url http://localhost:6334 --inference-mode local --embedding-endpoint http://127.0.0.1:1234/v1/embeddings --embedding-model text-embedding-all-minilm-l6-v2-embedding

Compatible embedding servers: LM Studio, Ollama, llamafile, Cohere, OpenAI, or any server that implements the /v1/embeddings API.

External connect
qql-go connect
--url https://your-qdrant.example.com:6334
--secret <qdrant-api-key>
--inference-mode external
--embedding-endpoint https://api.openai.com/v1/embeddings
--embedding-key sk-...
--embedding-model text-embedding-3-small

The config file lives at ~/.qql/config.json. File permissions are 0o600 (contains secrets).

For local/external mode, BM25 parameters can be tuned in the config:

ParameterDefaultDescription
k11.5Term frequency saturation
b0.75Length normalization
avgdl100Average document length
Disconnect
qql-go disconnect

Removes the stored config. Subsequent commands will require reconnection.

Health check
qql-go doctor qql-go doctor --quiet --json
FeatureCloudLocalExternal
Dense insert/search
Hybrid insert/search
Sparse-only search
RERANK
Recommend