cross-posted from: https://lemmy.intai.tech/post/41889

A member of the orca team is taking the orca data set and forking the project. This is billed as the “uncensored” data set. The orca team claims it is an earlier set with less refinement.