ValueError: Found modules on cpu/disk

I was trying to use Mistral-7B as the reward model but i kept getting error **"ValueError: Found modules on cpu/disk. Using Exllama or Exllamav2 backend requires all the modules to be on GPU.You can deactivate exllama backend by setting `disable_exllama=True` in the quantization config object"**.
All I did was changed the name of model and tokenizer in the "configs/ppo_flan_sentiments.yml" file. and changed the '__init__' method in the class ZeroShotRewardModel according to my model.
Can someone tell what can i do to resolve the error?

![RLAID_error](https://github.com/vicgalle/zero-shot-reward-models/assets/105775834/2d10329f-2c23-4cf2-af2c-697cf061b4e2)
![constructor](https://github.com/vicgalle/zero-shot-reward-models/assets/105775834/b702a6ca-c10a-4d4f-bea3-b4bed6e00aee)

I am passing the reward model and config to trlx.train() in the main method in a similar way as done in ppo_flan_sentiments.py.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Found modules on cpu/disk #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

ValueError: Found modules on cpu/disk #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions