So, I’ve heard that ML manipulates tokens and specifically for the English corpora they take place of words. If we want model to be polite and not to speak uncomfortable language we can remove certain words from the internal array where all tokens and their associative data are stored, for example “fuck”.

  • Kalash
    link
    fedilink
    79 months ago

    Just run the output through a simple string replacement function before returning it to the user. No need to mess with the model itself.

    • aardA
      link
      29 months ago

      A well proven clbuttic solution.