Products
Zilliz Cloud
Fully-managed vector database service designed for speed, scale and high performance.
Zilliz Cloud vs. Milvus
Milvus
Open-source vector database built for billion-scale vector similarity search.
High-Performance Vector Database Made Serverless.
Pricing
Business Critical Plan
Developers
Documentation
The Zilliz Cloud Developer Hub where you can find all the information to work with Zilliz Cloud
Learn More
Join the Milvus Discord Community
Resources
Blog Guides Research Analyst Reports Webinars
Definitive Guide to Choosing a Vector Database
Customers
By Use CaseRetrieval Augmented Generation View all use cases View by industry View all customer stories
Filevine and Zilliz Cloud: Transforming Legal Case Management with Vector Search

Book a Demo Log in Get Started Free

Your AI Reference Guide
What is the discount factor in reinforcement learning?

What is the discount factor in reinforcement learning?

What is the discount factor in reinforcement learning?

The discount factor (denoted as 𝛾) in reinforcement learning (RL) is a value between 0 and 1 that determines the agent’s preference for immediate versus future rewards. A discount factor closer to 1 indicates that the agent values future rewards nearly as much as immediate rewards, while a discount factor closer to 0 means the agent prioritizes immediate rewards.

The discount factor is used to calculate the present value of future rewards in an agent's decision-making process. For example, if an agent receives a reward of 10 in the next state, and the discount factor is 0.9, the agent would treat that reward as worth 9 in the current state. This is important for tasks where long-term planning and delayed rewards are crucial.

In practice, the discount factor helps balance short-term and long-term goals. A lower discount factor might be useful in tasks where immediate results are more important, such as in a fast-paced game, while a higher discount factor is useful in tasks like investment planning, where future outcomes are more significant.

Recommended AI Learn Series

Exploring Vector Database Use Cases
Getting Started with Milvus
Natural Language Processing (NLP) Advanced Guide
Large Language Models (LLMs) 101
GenAI Ecosystem
All learn series →

VectorDB for GenAI Apps

Zilliz Cloud is a managed vector database perfect for building GenAI applications.

Try Zilliz Cloud for Free

Share this article

Keep Reading

What is anomaly detection in predictive analytics?

Anomaly detection in predictive analytics refers to the process of identifying data points, events, or observations that

What is zero-shot learning in image search?

Zero-shot learning in image search refers to the ability to recognize and classify images based on categories that the s

What is SaaS customer success management?

SaaS customer success management refers to the strategies and practices used by software-as-a-service (SaaS) companies t

AI Assistant