-
Notifications
You must be signed in to change notification settings - Fork 94
Open
Description
It seems that NewsRecommendation generates the same context vector for each arm.
Reading LinUCB(2011 Li et al.) paper I have the impression that the context vector should be different (at least at some extend depending the arms features) for each arm. I am mistaken? Or the the implementation is that just for convenience?
ts_tutorial/src/news_recommendation/env_recommendation.py
Lines 39 to 44 in 3e229f7
| context = [] | |
| context_vector = np.random.binomial(1,max(0,1/(self.dim-1)),self.dim) | |
| context_vector[0] = 1 | |
| for i in range(self.num_articles): | |
| context.append(context_vector) | |
| self.current_rewards[i] = 1/(1+np.exp(-self.thetas[i].dot(context_vector))) |
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels