Article Type
Changed
Fri, 09/05/2025 - 13:47
Display Headline

Comparing the Quality of Patient Guidance on Dermatologic Care Generated by ChatGPT vs Reddit

To the Editor:

Online resources that are convenient and affordable play a crucial role in mitigating health inequality and improving patient access to health care information; however, the benefits are limited by the quality of information available, as medical misinformation can lead to patients engaging in harmful practices, making dangerous decisions, and even avoiding safe and effective treatments. In this study, we aimed to assess and compare the quality of patient guidance on dermatologic care generated by ChatGPT vs Reddit based on accuracy, appropriateness, and safety. It is essential to assess the quality and reliability of online health information to support patients in making informed decisions about their health.

The emergence and advancement of artificial intelligence and large language models such as ChatGPT present a new method for patients to access health care advice. ChatGPT can engage in conversation by accessing information from existing publicly available data on the internet, including books and websites, up to the year 2023 and providing humanlike responses with context.1 ChatGPT’s access to a breadth of online evidence-based literature ensures the dissemination of quality information that is quick and without inherent bias, offering the potential to more closely align with health care professionals. ChatGPT’s use in dermatology by patients has shown efficacy, with a 98.87% approval rate by dermatologists scoring its ability to recommend appropriate medication for common dermatologic conditions.2 However, ChatGPT has limitations when providing health care advice and has been observed to misunderstand health care standards, lack personalization, and offer incorrect references; currently, the latest publicly available version (ChatGPT 3.5) also is unable to analyze clinical images.3,4

Reddit is an online social media forum that allows users to post questions and photographs to which anyone can reply and offer advice. Patients may find comfort in online communities where they can connect with others facing similar challenges related to their diagnosis. Within these communities, the responses often share users’ own lived experiences and offer support based on what has and has not worked for them. Prior research found that users intentionally seeking health information via Reddit are likely to implement the advice they receive even without verification of its credibility, suggesting a trust and receptibility to ideas offered on the platform.5 Furthermore, a study analyzing the dermatologic content of 17 dermatology related subreddits that had 1000 or more subscribers found that 70.6% of posts fell under the category of “seeking health/cosmetic advice.”6 Reddit users thus are vulnerable to receiving advice based on personal bias and exposing their health information to the public.

We hypothesized that ChatGPT would provide users with guidance that was more closely aligned with typical dermatologists’ advice due to its thorough analysis and compilation of diverse sources and recommendations available on the internet. We expected Reddit to yield recommendations of lesser quality and a diminished safety score, primarily due to the absence of credibility-vetting mechanisms and the influence of personal biases within the advice shared.

User-submitted posts to large dermatologic community Reddit forums representing a few of the most common skin conditions (r/eczema, r/acne, r/Folliculitis, r/SebDerm, r/Hidradenitis, r/keratosis, and r/Psoriasis) were retrospectively reviewed from January 2024 to March 2024. The most popular posts that did not include photographs were included in our study. Posts with photographs were excluded, as clinical images were not able to be uploaded to the publicly available ChatGPT 3.5. We collected real user questions about common skin conditions from Reddit forums and then asked ChatGPT to answer those same questions. We compared ChatGPT’s responses to the most upvoted Reddit comments to see how they matched up (eTable).

CT115006197-eTable

Each ChatGPT response and the top-rated Reddit comment were independently evaluated by a board certified dermatologist (S.A.) and a dermatology resident (A.H.K.). The quality of the ChatGPT and Reddit responses were determined by scoring the accuracy, appropriateness, safety consideration, and specificity on a 5-point Likert scale (1=low, 5=high). The 2 evaluators’ mean scores for each of the 4 categories were calculated based on adequate interrater reliability, which was tested using Cohen’s κ coefficient. Related-samples sign tests were used to compare ChatGPT and Reddit responses for each of the 4 categories. Analysis was completed using SPSS statistics software version 29.0 (IBM). The evaluators also were asked to provide qualitative feedback on the strengths and weaknesses of each response.

Our retrospective review yielded 20 total questions: 5 (25%) on atopic dermatitis, 4 (20%) on acne, 4 (20%) on hidradenitis suppurativa, 4 (20%) on psoriasis, 1 (5%) on folliculitis, 1 (5%) on keratosis pilaris, and 1 (5%) on seborrheic dermatitis. The number of posts was limited to 20 due to the extensive time required for grading each response. These 20 questions were selected from a larger pool of eligible posts based on factors such as clarity and relevance to common skin conditions. With regard to the types of questions that were asked, 6 (30%) were related to general management of a diagnosis, 5 (25%) were on treatment recommendations for symptom relief, 3 (15%) were on optimal utilization of current treatment regimens, 2 (10%) were on prescription side effects, 2 (10%) were on diagnosis presentation, 1 (5%) was on potential triggers of the diagnosis, and 1 (5%) was on natural treatment recommendations.

Mean (SD) evaluator scores for accuracy were significantly higher among ChatGPT responses compared with Reddit (4.63 [0.60] vs 2.60 [0.98])(P<.001). ChatGPT responses also were significantly higher for appropriateness compared with Reddit (4.55 [0.71] vs 2.58 [1.02])(P<.001) and safety consideration (4.88 [0.56] vs 2.80[0.97])(P <.001). There was no significant difference in mean specificity scores between ChatGPT and Reddit (4.25[1.02] vs 3.80 [0.70])(P=.096)(Figure).

Aflatooni-figure
FIGURE. Average ratings from 2 evaluators of Reddit and ChatGPT responses to 20 dermatology-related questions for accuracy, appropriateness, specificity, and safety.

For the Reddit responses, the weighted Cohen’s κ coefficient between the 2 evaluators was 0.200 (95% CI, –.089 to .489) for accuracy, 0.255 (95% CI, .014-.497) for appropriateness, 0.385 (95% CI, .176-.594) for safety consideration, and –0.024 (95% CI, –.177 to .129) for specificity. For the ChatGPT responses, the weighted Cohen’s κ coefficient between the 2 evaluators was 0.426 (95% CI, .122-.730) for accuracy, 0.571 (95% CI, .294-.849) for appropriateness, 0.655 (95% CI, .632-.678) for safety consideration, and 0.313 (95% CI, .043-.584) for specificity.

The strengths and weaknesses of the responses also were qualitatively analyzed. One commonly observed strength was ChatGPT’s frequent and appropriate recommendation for users to consult a dermatologist. In the case of atopic dermatitis—one of the more frequently asked about conditions—ChatGPT consistently emphasized evidence-based strategies such as gentle skin care and moisturization, reflecting alignment with clinical guidelines. Additionally, a common weakness of both ChatGPT and Reddit responses generally was the lack of personalized guidance and comprehensive discussion of the risks and benefits of specific treatments. It also was noted that neither platform consistently explored differential diagnoses—for example, distinguishing atopic dermatitis from conditions such as allergic contact dermatitis—limiting the diagnostic depth of the responses.

ChatGPT and Reddit can provide patients with quick and accessible health information for various dermatologic concerns. The results of our study demonstrated a significantly higher level of accuracy, appropriateness, and safety of responses generated by ChatGPT compared with human-generated responses on Reddit (P<.001). Both platforms offered similarly specific responses to user inquiries, demonstrating ChatGPT’s ability to comprehend user questions and draw from publicly available texts and Reddit users’ contributing insights based on their own first-hand experiences.

Reddit’s dermatologic forums often feature personal anecdotes and unique treatments described by individual users. Although specific to particular dermatologic concerns, such advice lacks an evidence-based standard of care. With the noted inherent trust of patients seeking guidance within Reddit communities, patients may follow unhelpful or potentially dangerous medical advice.5 A study examining 300 user-submitted posts on popular Reddit dermatology forums during the COVID-19 pandemic found that the mean scores for top-rated comments’ potential to be misleading or dangerous was 2.33 out of 5 on a Likert scale (95% CI, 2.18- 2.48).7 Dermatologists should be aware of the potential risks associated with dermatologic advice offered on Reddit and should caution patients against relying solely on this information without consulting a qualified dermatologist first.

Reddit’s open-forum design provides licensed dermatologists with the opportunity to disseminate evidence based information regarding dermatologic conditions. Currently, there is a subreddit (r/AskDocs) that allows users to post medical questions that can be answered by moderator-verified physicians. Participation from dermatologists in online communities such as this can improve the quality of dermatologic information shared online, combat misinformation, and promote safe skin care practices.

ChatGPT offers more accurate, appropriate, and safe information compared to Reddit responses, but its answers lack personalization. In a clinical setting, a personalized treatment plan from a physician can be tailored with a comprehensive discussion of the risks and benefits. Further, clinical settings allow for diagnosis and confirmation via biopsy and meticulous history taking to ensure that the diagnosis and treatment plan are accurate. While ChatGPT may be an option for seeking basic advice on dermatologic conditions, a licensed dermatologist should always be consulted for proper medical advice. Services such as telehealth may be another option to for patients with limited access to care.

Since ChatGPT 3.5 does not support the ability to upload images, our study acknowledges a limitation regarding the inclusion of Reddit posts containing photographs. Images can improve the response quality from both Reddit users and ChatGPT. While the updated ChatGPT 4o is capable of processing images, it requires a monthly subscription fee. The free version was chosen for use in this study, as this may reflect the most likely version that patients of low socioeconomic status would utilize to access dermatologic care; however, there is potential for growth and improvement of ChatGPT’s capability in providing medical advice.

This study compared the strengths and limitations of ChatGPT’s and Reddit’s responses to common dermatologic inquiries. ChatGPT and Reddit both show potential to be helpful sources of dermatologic health information; however, their current versions have many limitations and require caution and careful examination by patients of the guidance provided. Clinicians should be aware of these limitations when advising patients and emphasize the importance of consulting a licensed dermatologist for personalized, evidence-based care. For the best medical advice, it is always advisable to consult with a licensed dermatologist.

References
  1. Roumeliotis KI, Tselikas ND. ChatGPT and open-AI models: a preliminary review. Future Internet. 2023;15:192. doi:10.3390/fi15060192
  2. Iqbal U, Lee LTJ, Rahmanti AR, et al. Can large language models provide secondary reliable opinion on treatment options for dermatological diseases? J Am Med Inform Assoc. 2024;31:1341-1347. doi:10.1093/jamia/ocae067
  3. Whiles BB, Bird VG, Canales BK, et al. Caution! AI bot has entered the patient chat: ChatGPT has limitations in providing accurate urologic healthcare advice. Urology. 2023;180:278-284. doi:10.1016/j.urology.2023.07.010
  4. Nastasi AJ, Courtright KR, Halpern SD, et al. A vignette-based evaluation of ChatGPT’s ability to provide appropriate and equitable medical advice across care contexts. Sci Rep. 2023;13:17885. doi:10.1038/s41598-023-45223-y
  5. Record RA, Silberman WR, Santiago JE, et al. I sought it, I Reddit: examining health information engagement behaviors among Reddit users. J Health Commun. 2018;23:470-476. doi:10.1080/1081073 0.2018.1465493
  6. Buntinx-Krieg T, Caravaglio J, Domozych R, et al. Dermatology on Reddit: elucidating trends in dermatologic communications on the world wide web. Dermatol Online J. 2017;23:13030/qt9dr1f7x6.
  7. Aboul-Fettouh N, Lee KP, Kash N, et al. Social media and dermatology during the COVID-19 pandemic: analyzing usersubmitted posts seeking dermatologic advice on Reddit. Cureus. 2023;15:E33720. doi:10.7759/cureus.33720
Article PDF
Author and Disclosure Information

From the Morsani College of Medicine, University of South Florida, Tampa. Emily Coughlin is from the Department of Medical Education and Drs. Lipman, Kucharik, and Albers are from the Department of Dermatology and Cutaneous Surgery.

The authors have no relevant financial disclosure to report.

Correspondence: Shaliz Aflatooni, BS, USF Health Morsani College of Medicine, 560 Channelside Dr, Tampa, FL 33602 ([email protected]).

Cutis. 2025 June;115(6):197-199, E3. doi:10.12788/cutis.1222

Issue
Cutis - 115(6)
Publications
Topics
Page Number
197-199
Sections
Author and Disclosure Information

From the Morsani College of Medicine, University of South Florida, Tampa. Emily Coughlin is from the Department of Medical Education and Drs. Lipman, Kucharik, and Albers are from the Department of Dermatology and Cutaneous Surgery.

The authors have no relevant financial disclosure to report.

Correspondence: Shaliz Aflatooni, BS, USF Health Morsani College of Medicine, 560 Channelside Dr, Tampa, FL 33602 ([email protected]).

Cutis. 2025 June;115(6):197-199, E3. doi:10.12788/cutis.1222

Author and Disclosure Information

From the Morsani College of Medicine, University of South Florida, Tampa. Emily Coughlin is from the Department of Medical Education and Drs. Lipman, Kucharik, and Albers are from the Department of Dermatology and Cutaneous Surgery.

The authors have no relevant financial disclosure to report.

Correspondence: Shaliz Aflatooni, BS, USF Health Morsani College of Medicine, 560 Channelside Dr, Tampa, FL 33602 ([email protected]).

Cutis. 2025 June;115(6):197-199, E3. doi:10.12788/cutis.1222

Article PDF
Article PDF

To the Editor:

Online resources that are convenient and affordable play a crucial role in mitigating health inequality and improving patient access to health care information; however, the benefits are limited by the quality of information available, as medical misinformation can lead to patients engaging in harmful practices, making dangerous decisions, and even avoiding safe and effective treatments. In this study, we aimed to assess and compare the quality of patient guidance on dermatologic care generated by ChatGPT vs Reddit based on accuracy, appropriateness, and safety. It is essential to assess the quality and reliability of online health information to support patients in making informed decisions about their health.

The emergence and advancement of artificial intelligence and large language models such as ChatGPT present a new method for patients to access health care advice. ChatGPT can engage in conversation by accessing information from existing publicly available data on the internet, including books and websites, up to the year 2023 and providing humanlike responses with context.1 ChatGPT’s access to a breadth of online evidence-based literature ensures the dissemination of quality information that is quick and without inherent bias, offering the potential to more closely align with health care professionals. ChatGPT’s use in dermatology by patients has shown efficacy, with a 98.87% approval rate by dermatologists scoring its ability to recommend appropriate medication for common dermatologic conditions.2 However, ChatGPT has limitations when providing health care advice and has been observed to misunderstand health care standards, lack personalization, and offer incorrect references; currently, the latest publicly available version (ChatGPT 3.5) also is unable to analyze clinical images.3,4

Reddit is an online social media forum that allows users to post questions and photographs to which anyone can reply and offer advice. Patients may find comfort in online communities where they can connect with others facing similar challenges related to their diagnosis. Within these communities, the responses often share users’ own lived experiences and offer support based on what has and has not worked for them. Prior research found that users intentionally seeking health information via Reddit are likely to implement the advice they receive even without verification of its credibility, suggesting a trust and receptibility to ideas offered on the platform.5 Furthermore, a study analyzing the dermatologic content of 17 dermatology related subreddits that had 1000 or more subscribers found that 70.6% of posts fell under the category of “seeking health/cosmetic advice.”6 Reddit users thus are vulnerable to receiving advice based on personal bias and exposing their health information to the public.

We hypothesized that ChatGPT would provide users with guidance that was more closely aligned with typical dermatologists’ advice due to its thorough analysis and compilation of diverse sources and recommendations available on the internet. We expected Reddit to yield recommendations of lesser quality and a diminished safety score, primarily due to the absence of credibility-vetting mechanisms and the influence of personal biases within the advice shared.

User-submitted posts to large dermatologic community Reddit forums representing a few of the most common skin conditions (r/eczema, r/acne, r/Folliculitis, r/SebDerm, r/Hidradenitis, r/keratosis, and r/Psoriasis) were retrospectively reviewed from January 2024 to March 2024. The most popular posts that did not include photographs were included in our study. Posts with photographs were excluded, as clinical images were not able to be uploaded to the publicly available ChatGPT 3.5. We collected real user questions about common skin conditions from Reddit forums and then asked ChatGPT to answer those same questions. We compared ChatGPT’s responses to the most upvoted Reddit comments to see how they matched up (eTable).

CT115006197-eTable

Each ChatGPT response and the top-rated Reddit comment were independently evaluated by a board certified dermatologist (S.A.) and a dermatology resident (A.H.K.). The quality of the ChatGPT and Reddit responses were determined by scoring the accuracy, appropriateness, safety consideration, and specificity on a 5-point Likert scale (1=low, 5=high). The 2 evaluators’ mean scores for each of the 4 categories were calculated based on adequate interrater reliability, which was tested using Cohen’s κ coefficient. Related-samples sign tests were used to compare ChatGPT and Reddit responses for each of the 4 categories. Analysis was completed using SPSS statistics software version 29.0 (IBM). The evaluators also were asked to provide qualitative feedback on the strengths and weaknesses of each response.

Our retrospective review yielded 20 total questions: 5 (25%) on atopic dermatitis, 4 (20%) on acne, 4 (20%) on hidradenitis suppurativa, 4 (20%) on psoriasis, 1 (5%) on folliculitis, 1 (5%) on keratosis pilaris, and 1 (5%) on seborrheic dermatitis. The number of posts was limited to 20 due to the extensive time required for grading each response. These 20 questions were selected from a larger pool of eligible posts based on factors such as clarity and relevance to common skin conditions. With regard to the types of questions that were asked, 6 (30%) were related to general management of a diagnosis, 5 (25%) were on treatment recommendations for symptom relief, 3 (15%) were on optimal utilization of current treatment regimens, 2 (10%) were on prescription side effects, 2 (10%) were on diagnosis presentation, 1 (5%) was on potential triggers of the diagnosis, and 1 (5%) was on natural treatment recommendations.

Mean (SD) evaluator scores for accuracy were significantly higher among ChatGPT responses compared with Reddit (4.63 [0.60] vs 2.60 [0.98])(P<.001). ChatGPT responses also were significantly higher for appropriateness compared with Reddit (4.55 [0.71] vs 2.58 [1.02])(P<.001) and safety consideration (4.88 [0.56] vs 2.80[0.97])(P <.001). There was no significant difference in mean specificity scores between ChatGPT and Reddit (4.25[1.02] vs 3.80 [0.70])(P=.096)(Figure).

Aflatooni-figure
FIGURE. Average ratings from 2 evaluators of Reddit and ChatGPT responses to 20 dermatology-related questions for accuracy, appropriateness, specificity, and safety.

For the Reddit responses, the weighted Cohen’s κ coefficient between the 2 evaluators was 0.200 (95% CI, –.089 to .489) for accuracy, 0.255 (95% CI, .014-.497) for appropriateness, 0.385 (95% CI, .176-.594) for safety consideration, and –0.024 (95% CI, –.177 to .129) for specificity. For the ChatGPT responses, the weighted Cohen’s κ coefficient between the 2 evaluators was 0.426 (95% CI, .122-.730) for accuracy, 0.571 (95% CI, .294-.849) for appropriateness, 0.655 (95% CI, .632-.678) for safety consideration, and 0.313 (95% CI, .043-.584) for specificity.

The strengths and weaknesses of the responses also were qualitatively analyzed. One commonly observed strength was ChatGPT’s frequent and appropriate recommendation for users to consult a dermatologist. In the case of atopic dermatitis—one of the more frequently asked about conditions—ChatGPT consistently emphasized evidence-based strategies such as gentle skin care and moisturization, reflecting alignment with clinical guidelines. Additionally, a common weakness of both ChatGPT and Reddit responses generally was the lack of personalized guidance and comprehensive discussion of the risks and benefits of specific treatments. It also was noted that neither platform consistently explored differential diagnoses—for example, distinguishing atopic dermatitis from conditions such as allergic contact dermatitis—limiting the diagnostic depth of the responses.

ChatGPT and Reddit can provide patients with quick and accessible health information for various dermatologic concerns. The results of our study demonstrated a significantly higher level of accuracy, appropriateness, and safety of responses generated by ChatGPT compared with human-generated responses on Reddit (P<.001). Both platforms offered similarly specific responses to user inquiries, demonstrating ChatGPT’s ability to comprehend user questions and draw from publicly available texts and Reddit users’ contributing insights based on their own first-hand experiences.

Reddit’s dermatologic forums often feature personal anecdotes and unique treatments described by individual users. Although specific to particular dermatologic concerns, such advice lacks an evidence-based standard of care. With the noted inherent trust of patients seeking guidance within Reddit communities, patients may follow unhelpful or potentially dangerous medical advice.5 A study examining 300 user-submitted posts on popular Reddit dermatology forums during the COVID-19 pandemic found that the mean scores for top-rated comments’ potential to be misleading or dangerous was 2.33 out of 5 on a Likert scale (95% CI, 2.18- 2.48).7 Dermatologists should be aware of the potential risks associated with dermatologic advice offered on Reddit and should caution patients against relying solely on this information without consulting a qualified dermatologist first.

Reddit’s open-forum design provides licensed dermatologists with the opportunity to disseminate evidence based information regarding dermatologic conditions. Currently, there is a subreddit (r/AskDocs) that allows users to post medical questions that can be answered by moderator-verified physicians. Participation from dermatologists in online communities such as this can improve the quality of dermatologic information shared online, combat misinformation, and promote safe skin care practices.

ChatGPT offers more accurate, appropriate, and safe information compared to Reddit responses, but its answers lack personalization. In a clinical setting, a personalized treatment plan from a physician can be tailored with a comprehensive discussion of the risks and benefits. Further, clinical settings allow for diagnosis and confirmation via biopsy and meticulous history taking to ensure that the diagnosis and treatment plan are accurate. While ChatGPT may be an option for seeking basic advice on dermatologic conditions, a licensed dermatologist should always be consulted for proper medical advice. Services such as telehealth may be another option to for patients with limited access to care.

Since ChatGPT 3.5 does not support the ability to upload images, our study acknowledges a limitation regarding the inclusion of Reddit posts containing photographs. Images can improve the response quality from both Reddit users and ChatGPT. While the updated ChatGPT 4o is capable of processing images, it requires a monthly subscription fee. The free version was chosen for use in this study, as this may reflect the most likely version that patients of low socioeconomic status would utilize to access dermatologic care; however, there is potential for growth and improvement of ChatGPT’s capability in providing medical advice.

This study compared the strengths and limitations of ChatGPT’s and Reddit’s responses to common dermatologic inquiries. ChatGPT and Reddit both show potential to be helpful sources of dermatologic health information; however, their current versions have many limitations and require caution and careful examination by patients of the guidance provided. Clinicians should be aware of these limitations when advising patients and emphasize the importance of consulting a licensed dermatologist for personalized, evidence-based care. For the best medical advice, it is always advisable to consult with a licensed dermatologist.

To the Editor:

Online resources that are convenient and affordable play a crucial role in mitigating health inequality and improving patient access to health care information; however, the benefits are limited by the quality of information available, as medical misinformation can lead to patients engaging in harmful practices, making dangerous decisions, and even avoiding safe and effective treatments. In this study, we aimed to assess and compare the quality of patient guidance on dermatologic care generated by ChatGPT vs Reddit based on accuracy, appropriateness, and safety. It is essential to assess the quality and reliability of online health information to support patients in making informed decisions about their health.

The emergence and advancement of artificial intelligence and large language models such as ChatGPT present a new method for patients to access health care advice. ChatGPT can engage in conversation by accessing information from existing publicly available data on the internet, including books and websites, up to the year 2023 and providing humanlike responses with context.1 ChatGPT’s access to a breadth of online evidence-based literature ensures the dissemination of quality information that is quick and without inherent bias, offering the potential to more closely align with health care professionals. ChatGPT’s use in dermatology by patients has shown efficacy, with a 98.87% approval rate by dermatologists scoring its ability to recommend appropriate medication for common dermatologic conditions.2 However, ChatGPT has limitations when providing health care advice and has been observed to misunderstand health care standards, lack personalization, and offer incorrect references; currently, the latest publicly available version (ChatGPT 3.5) also is unable to analyze clinical images.3,4

Reddit is an online social media forum that allows users to post questions and photographs to which anyone can reply and offer advice. Patients may find comfort in online communities where they can connect with others facing similar challenges related to their diagnosis. Within these communities, the responses often share users’ own lived experiences and offer support based on what has and has not worked for them. Prior research found that users intentionally seeking health information via Reddit are likely to implement the advice they receive even without verification of its credibility, suggesting a trust and receptibility to ideas offered on the platform.5 Furthermore, a study analyzing the dermatologic content of 17 dermatology related subreddits that had 1000 or more subscribers found that 70.6% of posts fell under the category of “seeking health/cosmetic advice.”6 Reddit users thus are vulnerable to receiving advice based on personal bias and exposing their health information to the public.

We hypothesized that ChatGPT would provide users with guidance that was more closely aligned with typical dermatologists’ advice due to its thorough analysis and compilation of diverse sources and recommendations available on the internet. We expected Reddit to yield recommendations of lesser quality and a diminished safety score, primarily due to the absence of credibility-vetting mechanisms and the influence of personal biases within the advice shared.

User-submitted posts to large dermatologic community Reddit forums representing a few of the most common skin conditions (r/eczema, r/acne, r/Folliculitis, r/SebDerm, r/Hidradenitis, r/keratosis, and r/Psoriasis) were retrospectively reviewed from January 2024 to March 2024. The most popular posts that did not include photographs were included in our study. Posts with photographs were excluded, as clinical images were not able to be uploaded to the publicly available ChatGPT 3.5. We collected real user questions about common skin conditions from Reddit forums and then asked ChatGPT to answer those same questions. We compared ChatGPT’s responses to the most upvoted Reddit comments to see how they matched up (eTable).

CT115006197-eTable

Each ChatGPT response and the top-rated Reddit comment were independently evaluated by a board certified dermatologist (S.A.) and a dermatology resident (A.H.K.). The quality of the ChatGPT and Reddit responses were determined by scoring the accuracy, appropriateness, safety consideration, and specificity on a 5-point Likert scale (1=low, 5=high). The 2 evaluators’ mean scores for each of the 4 categories were calculated based on adequate interrater reliability, which was tested using Cohen’s κ coefficient. Related-samples sign tests were used to compare ChatGPT and Reddit responses for each of the 4 categories. Analysis was completed using SPSS statistics software version 29.0 (IBM). The evaluators also were asked to provide qualitative feedback on the strengths and weaknesses of each response.

Our retrospective review yielded 20 total questions: 5 (25%) on atopic dermatitis, 4 (20%) on acne, 4 (20%) on hidradenitis suppurativa, 4 (20%) on psoriasis, 1 (5%) on folliculitis, 1 (5%) on keratosis pilaris, and 1 (5%) on seborrheic dermatitis. The number of posts was limited to 20 due to the extensive time required for grading each response. These 20 questions were selected from a larger pool of eligible posts based on factors such as clarity and relevance to common skin conditions. With regard to the types of questions that were asked, 6 (30%) were related to general management of a diagnosis, 5 (25%) were on treatment recommendations for symptom relief, 3 (15%) were on optimal utilization of current treatment regimens, 2 (10%) were on prescription side effects, 2 (10%) were on diagnosis presentation, 1 (5%) was on potential triggers of the diagnosis, and 1 (5%) was on natural treatment recommendations.

Mean (SD) evaluator scores for accuracy were significantly higher among ChatGPT responses compared with Reddit (4.63 [0.60] vs 2.60 [0.98])(P<.001). ChatGPT responses also were significantly higher for appropriateness compared with Reddit (4.55 [0.71] vs 2.58 [1.02])(P<.001) and safety consideration (4.88 [0.56] vs 2.80[0.97])(P <.001). There was no significant difference in mean specificity scores between ChatGPT and Reddit (4.25[1.02] vs 3.80 [0.70])(P=.096)(Figure).

Aflatooni-figure
FIGURE. Average ratings from 2 evaluators of Reddit and ChatGPT responses to 20 dermatology-related questions for accuracy, appropriateness, specificity, and safety.

For the Reddit responses, the weighted Cohen’s κ coefficient between the 2 evaluators was 0.200 (95% CI, –.089 to .489) for accuracy, 0.255 (95% CI, .014-.497) for appropriateness, 0.385 (95% CI, .176-.594) for safety consideration, and –0.024 (95% CI, –.177 to .129) for specificity. For the ChatGPT responses, the weighted Cohen’s κ coefficient between the 2 evaluators was 0.426 (95% CI, .122-.730) for accuracy, 0.571 (95% CI, .294-.849) for appropriateness, 0.655 (95% CI, .632-.678) for safety consideration, and 0.313 (95% CI, .043-.584) for specificity.

The strengths and weaknesses of the responses also were qualitatively analyzed. One commonly observed strength was ChatGPT’s frequent and appropriate recommendation for users to consult a dermatologist. In the case of atopic dermatitis—one of the more frequently asked about conditions—ChatGPT consistently emphasized evidence-based strategies such as gentle skin care and moisturization, reflecting alignment with clinical guidelines. Additionally, a common weakness of both ChatGPT and Reddit responses generally was the lack of personalized guidance and comprehensive discussion of the risks and benefits of specific treatments. It also was noted that neither platform consistently explored differential diagnoses—for example, distinguishing atopic dermatitis from conditions such as allergic contact dermatitis—limiting the diagnostic depth of the responses.

ChatGPT and Reddit can provide patients with quick and accessible health information for various dermatologic concerns. The results of our study demonstrated a significantly higher level of accuracy, appropriateness, and safety of responses generated by ChatGPT compared with human-generated responses on Reddit (P<.001). Both platforms offered similarly specific responses to user inquiries, demonstrating ChatGPT’s ability to comprehend user questions and draw from publicly available texts and Reddit users’ contributing insights based on their own first-hand experiences.

Reddit’s dermatologic forums often feature personal anecdotes and unique treatments described by individual users. Although specific to particular dermatologic concerns, such advice lacks an evidence-based standard of care. With the noted inherent trust of patients seeking guidance within Reddit communities, patients may follow unhelpful or potentially dangerous medical advice.5 A study examining 300 user-submitted posts on popular Reddit dermatology forums during the COVID-19 pandemic found that the mean scores for top-rated comments’ potential to be misleading or dangerous was 2.33 out of 5 on a Likert scale (95% CI, 2.18- 2.48).7 Dermatologists should be aware of the potential risks associated with dermatologic advice offered on Reddit and should caution patients against relying solely on this information without consulting a qualified dermatologist first.

Reddit’s open-forum design provides licensed dermatologists with the opportunity to disseminate evidence based information regarding dermatologic conditions. Currently, there is a subreddit (r/AskDocs) that allows users to post medical questions that can be answered by moderator-verified physicians. Participation from dermatologists in online communities such as this can improve the quality of dermatologic information shared online, combat misinformation, and promote safe skin care practices.

ChatGPT offers more accurate, appropriate, and safe information compared to Reddit responses, but its answers lack personalization. In a clinical setting, a personalized treatment plan from a physician can be tailored with a comprehensive discussion of the risks and benefits. Further, clinical settings allow for diagnosis and confirmation via biopsy and meticulous history taking to ensure that the diagnosis and treatment plan are accurate. While ChatGPT may be an option for seeking basic advice on dermatologic conditions, a licensed dermatologist should always be consulted for proper medical advice. Services such as telehealth may be another option to for patients with limited access to care.

Since ChatGPT 3.5 does not support the ability to upload images, our study acknowledges a limitation regarding the inclusion of Reddit posts containing photographs. Images can improve the response quality from both Reddit users and ChatGPT. While the updated ChatGPT 4o is capable of processing images, it requires a monthly subscription fee. The free version was chosen for use in this study, as this may reflect the most likely version that patients of low socioeconomic status would utilize to access dermatologic care; however, there is potential for growth and improvement of ChatGPT’s capability in providing medical advice.

This study compared the strengths and limitations of ChatGPT’s and Reddit’s responses to common dermatologic inquiries. ChatGPT and Reddit both show potential to be helpful sources of dermatologic health information; however, their current versions have many limitations and require caution and careful examination by patients of the guidance provided. Clinicians should be aware of these limitations when advising patients and emphasize the importance of consulting a licensed dermatologist for personalized, evidence-based care. For the best medical advice, it is always advisable to consult with a licensed dermatologist.

References
  1. Roumeliotis KI, Tselikas ND. ChatGPT and open-AI models: a preliminary review. Future Internet. 2023;15:192. doi:10.3390/fi15060192
  2. Iqbal U, Lee LTJ, Rahmanti AR, et al. Can large language models provide secondary reliable opinion on treatment options for dermatological diseases? J Am Med Inform Assoc. 2024;31:1341-1347. doi:10.1093/jamia/ocae067
  3. Whiles BB, Bird VG, Canales BK, et al. Caution! AI bot has entered the patient chat: ChatGPT has limitations in providing accurate urologic healthcare advice. Urology. 2023;180:278-284. doi:10.1016/j.urology.2023.07.010
  4. Nastasi AJ, Courtright KR, Halpern SD, et al. A vignette-based evaluation of ChatGPT’s ability to provide appropriate and equitable medical advice across care contexts. Sci Rep. 2023;13:17885. doi:10.1038/s41598-023-45223-y
  5. Record RA, Silberman WR, Santiago JE, et al. I sought it, I Reddit: examining health information engagement behaviors among Reddit users. J Health Commun. 2018;23:470-476. doi:10.1080/1081073 0.2018.1465493
  6. Buntinx-Krieg T, Caravaglio J, Domozych R, et al. Dermatology on Reddit: elucidating trends in dermatologic communications on the world wide web. Dermatol Online J. 2017;23:13030/qt9dr1f7x6.
  7. Aboul-Fettouh N, Lee KP, Kash N, et al. Social media and dermatology during the COVID-19 pandemic: analyzing usersubmitted posts seeking dermatologic advice on Reddit. Cureus. 2023;15:E33720. doi:10.7759/cureus.33720
References
  1. Roumeliotis KI, Tselikas ND. ChatGPT and open-AI models: a preliminary review. Future Internet. 2023;15:192. doi:10.3390/fi15060192
  2. Iqbal U, Lee LTJ, Rahmanti AR, et al. Can large language models provide secondary reliable opinion on treatment options for dermatological diseases? J Am Med Inform Assoc. 2024;31:1341-1347. doi:10.1093/jamia/ocae067
  3. Whiles BB, Bird VG, Canales BK, et al. Caution! AI bot has entered the patient chat: ChatGPT has limitations in providing accurate urologic healthcare advice. Urology. 2023;180:278-284. doi:10.1016/j.urology.2023.07.010
  4. Nastasi AJ, Courtright KR, Halpern SD, et al. A vignette-based evaluation of ChatGPT’s ability to provide appropriate and equitable medical advice across care contexts. Sci Rep. 2023;13:17885. doi:10.1038/s41598-023-45223-y
  5. Record RA, Silberman WR, Santiago JE, et al. I sought it, I Reddit: examining health information engagement behaviors among Reddit users. J Health Commun. 2018;23:470-476. doi:10.1080/1081073 0.2018.1465493
  6. Buntinx-Krieg T, Caravaglio J, Domozych R, et al. Dermatology on Reddit: elucidating trends in dermatologic communications on the world wide web. Dermatol Online J. 2017;23:13030/qt9dr1f7x6.
  7. Aboul-Fettouh N, Lee KP, Kash N, et al. Social media and dermatology during the COVID-19 pandemic: analyzing usersubmitted posts seeking dermatologic advice on Reddit. Cureus. 2023;15:E33720. doi:10.7759/cureus.33720
Issue
Cutis - 115(6)
Issue
Cutis - 115(6)
Page Number
197-199
Page Number
197-199
Publications
Publications
Topics
Article Type
Display Headline

Comparing the Quality of Patient Guidance on Dermatologic Care Generated by ChatGPT vs Reddit

Display Headline

Comparing the Quality of Patient Guidance on Dermatologic Care Generated by ChatGPT vs Reddit

Sections
Inside the Article

PRACTICE POINTS

  • ChatGPT and Reddit are free, convenient, and accessible online resources that patients may use for guidance on dermatologic care.
  • Dermatologists should be aware of the potential risks associated with obtaining medical guidance from ChatGPT and Reddit and caution patients on them.
  • An increasing presence of dermatologists on online public forums can increase the dissemination of reliable health care information.
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Gate On Date
Tue, 06/03/2025 - 12:15
Un-Gate On Date
Tue, 06/03/2025 - 12:15
Use ProPublica
CFC Schedule Remove Status
Tue, 06/03/2025 - 12:15
Hide sidebar & use full width
render the right sidebar.
Conference Recap Checkbox
Not Conference Recap
Clinical Edge
Display the Slideshow in this Article
Medscape Article
Display survey writer
Reuters content
Disable Inline Native ads
WebMD Article
survey writer start date
Tue, 06/03/2025 - 12:15