The Singularity is here --> Self-Awareness of ChatGPT
Thats quite interesting:
https://chat.openai.com/chat
Q: Is ChatGPT based only on peer reviewed sources or
does it also consider other information.
A: ChatGPT is a machine learning model that is trained
on a dataset of text, which can include a variety of sources
such as peer-reviewed articles, websites, books, and more.
The specific sources used to train the model may vary
depending on the specific version of the model, but it is
likely that a mix of sources was used, not only peer-reviewed
articles.
The model itself is a variant of GPT(Generative Pre-training
Transformer) architecture based on Transformer that has
been pre-trained on a large corpus of text. It then can fine-tuned
to a specific task or Domain using labeled data.
Q: How is this mix defined. When are peer-reviewed
articles more weighted.
A: The specific mix of sources used to train a model like
ChatGPT would be defined by the creators of the model or
the organization that released it. It's likely that a diverse
range of text was used in order to make the model generalize
well to a variety of tasks and inputs.
Peer-reviewed articles may be given more weight in situations
where the model is being fine-tuned for a specific domain that
relies heavily on research and scholarly work. Since peer-reviewed
articles have been vetted by experts in the field, they are often
considered to be more reliable and credible sources of information.
Therefore, when a high level of accuracy and trustworthiness is
required, such as in scientific or medical domain, the model might
have been fine-tuned with a dataset that have more
peer-reviewed articles.
It's also worth noting that the training data is typically curated and
pre-processed before being used to train a model, and this can also
influence the mix of sources. For example, specific publications or
websites might be excluded, or certain types of text might be
oversampled or undersampled to balance the dataset.
LoL