In this part of the tutorial, we explore how to enhance the model’s understanding and responses by using a dataset filled with examples. Initially, we worked with a single complaint. Now, by introducing a broader range of examples, the model can better grasp variations and details, leading to more precise categorizations.
Datasets are crucial for complex tasks or further refining (fine-tuning) your model.
Go to the Datasets tab. This is where you can manage different datasets.
Select New Dataset.
In Dataset type, select the Hugging Face tab.
Fill in the details:
Dataset name: determined-ai/customers-complaints
Description: Customer complaints pre-processed and classified into issue categories
Config name: Leave this field blank. This dataset does not have multiple configurations, or sub-parts. See Hugging Face Configurations for more details.
Select Create Dataset.
Your dataset is now ready to be linked to your project!
Provide clear instructions to the model on how to use the data:
Instruction
You are a customer support expert in a financial sector. Your task is to classify customer complaints
as related to a particular product from the following list. Don't include any explanation.
Only respond with one product from the numbered list.
Categories:
1. Credit reporting, credit repair services, or other personal consumer reports
2. Credit card or prepaid card
3. Credit card
4. Student loan
5. Debt collection
6. Checking or savings account
7. Credit reporting or other personal consumer reports
8. Vehicle loan or lease
9. Payday loan, title loan, or personal loan
10. Mortgage
Examples
Below are examples:
Complaint: {{Consumer_complaint_narrative}}
Answer: {{Product}}
Review the generated output and check how accurately the complaints are categorized.
By providing examples and linking a dataset, we can see the new output contains the original linked dataset with new columns including a column where the model has categorized the complaint.
Provide Examples and Link a Dataset: By incorporating examples and connecting a dataset, you enabled the model to understand and process a broader range of data. This step is crucial for enhancing the model’s ability to categorize information accurately.