전체 글 5

gpt 데이터 증강

system_prompt = """당신은 답변에 대한 질문을 추가 생성하는 챗봇 어시스턴트 입니다.당신에게는 \\n으로 구분된 여러개의 질문과 하나의 답변이 제공됩니다. 당신의 임무는 해당 질문과 답변을 참고하여 서술어를 바꾼 간결한 30개의 추가 질문을 생성하는 것입니다.""" from openai import OpenAIclient = OpenAI()# 추가 질문을 생성하는 함수def generate_additional_questions(question, answer):    prompt = f"질문: {question}\n답변: {answer}\n\n{system_prompt}"        try:        # GPT API 호출        response = client.chat.compl..

cookbook 2024.09.10

앙상블 리트리버-bm25(형태소 추가)+faiss, 명사 추출

from langchain_community.vectorstores import FAISSfrom langchain.retrievers import BM25Retriever, EnsembleRetrieverfrom langchain.schema import Documentfrom konlpy.tag import Oktfrom kiwipiepy import Kiwikiwi = Kiwi()def kiwi_tokenize(text):    return [token.form for token in kiwi.tokenize(text)]kiwi_bm25 = BM25Retriever.from_texts(texts, preprocess_func=kiwi_tokenize)faiss = FAISS.from_document..

cookbook 2024.09.09