plexus.cli.tuning.operations module
- plexus.cli.tuning.operations.append_feedback_to_conversation(conversation_history, feedback_message)
Append feedback message to conversation history as a user message.
- Args:
conversation_history (list): Existing conversation messages feedback_message (str): Feedback message to append
- Returns:
list: Updated conversation history with feedback appended
- plexus.cli.tuning.operations.contains_disagreement(completion_text)
Check if the completion contains a disagreement with the gold standard.
- Args:
completion_text (str): The generated completion text
- Returns:
bool: True if the completion indicates disagreement
- plexus.cli.tuning.operations.create_hallucination_feedback(verification_result, original_transcript)
Create a feedback message for the LLM about hallucinated quotes.
- Args:
verification_result (dict): Result from verify_quotes_in_completion original_transcript (str): The original transcript text
- Returns:
str: Feedback message for the LLM
- plexus.cli.tuning.operations.create_no_quotes_feedback()
Create a feedback message instructing the LLM to generate a completion without any quotes. This is used as a last-ditch effort to save examples that repeatedly fail quote verification.
- Returns:
str: Feedback message instructing no quotes usage
- plexus.cli.tuning.operations.generate_llm_completion(score_instance, row, completion_template, conversation_history=None)
Generate completion using an LLM with full context and gold standard labels.
- Args:
score_instance: The score instance containing configuration row: The data row containing transcript and other information completion_template: The Jinja2 template with label interpolation for the LLM conversation_history: Optional list of previous messages for iterative improvement
- Returns:
tuple: (completion_text, updated_conversation_history)
- plexus.cli.tuning.operations.get_file_path(output_dir, file_type)
- plexus.cli.tuning.operations.get_id_file_path(output_dir, file_type)
- plexus.cli.tuning.operations.get_output_dir(scorecard_name, score_name, subsampled=False, max_tokens=None)
- plexus.cli.tuning.operations.verify_quotes_in_completion(completion_text, original_transcript, fuzzy_threshold=0.8, enable_fuzzy=True, debug=False)
Verify that all quotes in the completion exist in the original transcript.
- Args:
completion_text (str): The generated completion text original_transcript (str): The original transcript text fuzzy_threshold (float): Threshold for fuzzy matching (0.0-1.0) enable_fuzzy (bool): Whether to enable fuzzy matching debug (bool): Whether to enable detailed debug logging
- Returns:
- dict: Contains verification results with keys:
verified_quotes: list of quotes found in transcript
hallucinated_quotes: list of quotes NOT found in transcript
fuzzy_matched_quotes: list of quotes matched with fuzzy logic
is_valid: boolean indicating if all quotes are valid
verification_details: detailed information about each quote