Skip to article frontmatterSkip to article content

Bijeenkomst 5 (vervolg-1)

Co-intelligentie: medische diagnostiek

Artikelen:

Brodeur, P. G., Buckley, T. A., Kanjee, Z., Goh, E., Ling, E. B., Jain, P., Cabral, S., Abdulnour, R.-E., Haimovich, A., Freed, J. A., Olson, A., Morgan, D. J., Hom, J., Gallo, R., Horvitz, E., Manrai, A. K., & Rodman, A. (z.d.). Superhuman performance of a large language model on the reasoning tasks of a physician. Brodeur et al. (2024)

Goh, E., Gallo, R., Hom, J., Strong, E., Weng, Y., Kerman, H., Cool, J. A., Kanjee, Z., Parsons, A. S., Ahuja, N., Horvitz, E., Yang, D., Milstein, A., Olson, A. P. J., Rodman, A., & Chen, J. H. (2024). Large Language Model Influence on Diagnostic Reasoning: A Randomized Clinical Trial. JAMA Network Open, 7(10), e2440969. Goh et al. (2024)

Actuele ontwikkelingen

  • nov 2023: ChatGPT 4
  • mei 2024: ChatGPT 4o (multimodaal)
  • sep 2024: Advanced Voice mode
  • sep 2024: o1 preview (“reasoning”)
  • okt 2024: ChatGPT Canvas
  • nov 2024: ChatGPT Search
  • dec 2024: Advanced Voice mode - met video input
  • dec 2024: o1 model (met video input)

verwacht voorjaar 2025: o3 model (opvolger o1)

LLM Leaderboard. https://lmarena.ai/?leaderboard

Zie voor uitleg ELO-rating: Elo-rating

ChatGPT Canvas

zie: Canvas tutorial

References
  1. Brodeur, P. G., Buckley, T. A., Kanjee, Z., Goh, E., Ling, E. B., Jain, P., Cabral, S., Abdulnour, R.-E., Haimovich, A., Freed, J. A., Olson, A., Morgan, D. J., Hom, J., Gallo, R., Horvitz, E., Chen, J., Manrai, A. K., & Rodman, A. (2024). Superhuman performance of a large language model on the reasoning tasks of a physician. arXiv. 10.48550/ARXIV.2412.10849
  2. Goh, E., Gallo, R., Hom, J., Strong, E., Weng, Y., Kerman, H., Cool, J. A., Kanjee, Z., Parsons, A. S., Ahuja, N., Horvitz, E., Yang, D., Milstein, A., Olson, A. P. J., Rodman, A., & Chen, J. H. (2024). Large Language Model Influence on Diagnostic Reasoning: A Randomized Clinical Trial. JAMA Network Open, 7(10), e2440969. 10.1001/jamanetworkopen.2024.40969