NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Lightweight Safety Classification Using Pruned Language Models (arxiv.org)
gdiamos 2 days ago [-]
This is really easy to set up - and is much more accurate than asking the LLM to predict True/False.

Just feed the outputs of an embedding API into logistic regression, e.g. from sklearn.

  import voyageai
  
  vo = voyageai.Client()
  # This will automatically use the environment variable VOYAGE_API_KEY.
  # Alternatively, you can use vo = voyageai.Client(api_key="<your secret key>")

  texts = ["Sample text 1", "Sample text 2"]

  result = vo.embed(texts, model="voyage-2", input_type="document")
  print(result.embeddings[0])

https://scikit-learn.org/1.5/modules/generated/sklearn.linea...
bberenberg 2 days ago [-]
Are these models available for us to try out?
flatpepsi17 2 days ago [-]
I'd pay good money for a local LLM with no "content safety" at all.
3 days ago [-]
tmasterman 3 days ago [-]
[dead]
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 06:55:40 GMT+0000 (Coordinated Universal Time) with Vercel.