English

English

When Health Data Becomes a Conversation Partner: Perplexity’s New Integrations, Seen From the Therapy Room

In therapy, “health data” almost never arrives as a clean story and Perplexity’s latest health update leans right into that reality. The announcement centers on new integrations that let people bring together personal health information, organize it into dashboards, and use it to create clearer summaries and questions for medical visits. From the therapy room, that immediately raises a human question: what changes when a person can gather their health signals in one place and actually talk through them, instead of chasing them across apps? From a therapist’s point of view, the best-case impact is simple and practical: structure. Many clients struggle to summarize what’s happening in a way a clinician can use, when it started, what triggers it, what makes it better or worse, what’s been tried, what changed, and how it affects sleep, work, appetite, mood, and relationships. If a tool can help draft a pre-visit summary from the mess of real life, that can reduce cognitive load, reduce shame (“I can’t explain it well”), and help someone walk into an appointment with clearer questions and fewer omissions. But the inconveniences and challenges are real, and they’re not just about setup. The biggest one I see clinically is that a single dashboard can quietly become a “threat monitor.” For clients prone to health anxiety, panic, OCD-style reassurance seeking, trauma-related body scanning, or chronic stress, more tracking doesn’t always equal more clarity. It can increase checking, amplify normal fluctuations, and keep the nervous system on alert, especially when numbers feel like verdicts instead of context. Another challenge is false clarity. Wearables are noisy, labs are snapshots, and medical records can be incomplete or inconsistent. When an AI-generated summary sounds confident, it can pull people toward conclusions that aren’t actually supported sometimes in ways that increase catastrophizing, sometimes in ways that minimize something important. In therapy, I’m less worried about whether the tool is “smart,” and more worried about whether it can communicate uncertainty honestly, and whether the person using it can hold that uncertainty without spiraling. There’s also the basic friction of access and use. Integrating accounts, permissions, and records can be confusing, and the people who need the most support are often the least resourced to troubleshoot a complicated setup, especially when they’re already exhausted, in pain, or overwhelmed. If the tool becomes another task that they “fail” at, it can reinforce the very helplessness we’re trying to reduce. Privacy is the quieter challenge that shows up later in session. People don’t just upload “health data”, they upload fear, vulnerability, and context that crosses into mental health, relationships, substance use, sexual health, and trauma history. When someone is distressed, they tend to trade privacy for reassurance. Part of a therapist’s job is to slow that moment down: not to shame the choice, but to help the client make it with clear eyes. If I were to incorporate something like this into therapy, I’d treat it as a collaborative artifact, not an authority. Bring the summary in, and we do what therapy does best: slow it down, reality-check it, and translate it into next steps. What’s missing? What might be an overinterpretation? Is this helping you feel more agency or is it feeding compulsive monitoring? Used carefully, these tools can support better conversations with medical providers. Used carelessly, they can make the story feel more coherent while quietly increasing anxiety. The difference is rarely the technology alone; it’s the relationship the person forms with it.

English

AI “Safety” Isn’t the Same as Clinical Safety: What the Research Trend Means for Our Therapy Practice

A useful piece of information to keep in mind is this: many AI chatbots look “safe” in testing because they refuse obvious harmful requests, but they can still respond unsafely when the same intent is phrased indirectly. This is often described as keyword-based safety (catching flagged words) versus intent awareness (understanding what the person is actually trying to do). In other words, the model may pass safety checks by recognizing certain terms, yet fail when distress is expressed in more human, ambiguous language. What this means for our therapy practice is immediate: our clients rarely speak in clean, explicit “risk language.” They test the waters. They minimize. They speak in metaphors. They code-switch. They communicate through tone and omission. If a tool only “detects” risk when the client uses the right words, that tool mirrors the least helpful kind of assessment, one that rewards performance and misses lived experience. A second key reality: many models are trained to be warm, validating, and agreeable. That can feel supportive, but clinically we know validation without discernment can become reinforcement. As therapists, we validate emotion while gently challenging distortions, checking reality, and tracking function over time. An AI can unintentionally validate emotion, interpretation and impulsive plan all at once, because it’s optimized to be helpful and coherent, not to hold clinical responsibility. Then there’s AI bias, and in therapy we should assume it shows up in ways that matter. Models can respond differently based on dialect, second-language English, culture-shaped expressions of pain, or even how “organized” a story sounds. The client who is dysregulated, repetitive, or fragmented (often highest need) may get generic reassurance, while the client who is articulate and persuasive may get more detailed, confidence-sounding answers. That is not just unfair—it can skew risk, rapport, and decision-making. So practically, when a client tells us they’ve been using a chatbot, we don’t treat it as a quirky side detail anymore, we treat it like a new “third voice” in the system. We ask: When do you use it, before bed, after fights, during panic? What does it tend to say? Do you feel calmer, or more certain? Does it reduce shame, or does it keep you looping? That assessment gives us clinical data: the tool’s role (soothing, escalating, avoiding, rehearsing), and the client’s relationship with it (dependency, secrecy, relief, shame). In session, this information nudges us to be more explicit about the difference between emotional validation and clinical containment. We might say: “A chatbot can sound caring and still miss what we’re tracking, risk patterns, triggers, relapse signatures, coercion, dissociation, trauma responses.” This isn’t anti-tech; it’s psychoeducation. It helps clients understand why “it felt supportive” isn’t the same as “it was safe for my nervous system and my real-life consequences.” It also changes how we handle risk conversations. Because AI safety can be cue-based, we assume clients may have learned (without meaning to) that certain wording gets shut down and other wording gets rewarded. That can shape disclosure: clients may avoid direct language, or they may rehearse safer-sounding narratives. Practically, we make more room for graded disclosure: “If it’s hard to say plainly, can we circle it, what are the closest words you can tolerate right now?” That keeps the door open without forcing performance. On the provider side, it pushes us to tighten boundaries and documentation when AI touches our workflow. If we use AI for drafts (handouts, summaries, exercises), we treat it like an intern: we review every line, remove anything that sounds overconfident, and check for bias-laden assumptions (culture, gender roles, family expectations, “should” language). If an organization suggests AI note-writing, our clinical question becomes: where is the data going, who can access it, and what happens if the model invents details? Clinical responsibility doesn’t outsource. When we’re advising colleagues or a clinic, we translate all of this into simple evaluation questions: Does the tool stay safe over multiple turns, or does it drift into over-agreement? Does it respond appropriately to indirect distress? Does it treat different dialects and cultural expressions consistently? Does it have clear escalation behavior (crisis resources, “get human help”) without shaming? If a vendor can’t answer those plainly, we assume the tool is optimized for demos, not for therapy-adjacent reality. Finally, we treat AI bias as an equity issue inside care, not a tech footnote. We build it into supervision and training: we role-play indirect phrasing, different cultural idioms of distress, and coercive-relationship narratives to see how tools might misread them. And we tell clients something grounding: “Use it if it helps, but don’t let it become your judge, your diagnosis, or your safety plan.” In practice, that stance keeps us clinically responsible while acknowledging the world our clients already live in.

English

One Framework, Many Workflows: A Deep Dive on the White House AI Blueprint—and Where It Still Feels Thin

The White House’s national AI policy framework released on March 20, 2026 (now a week old as of March 29, 2026) is best understood as a legislative blueprint, not a finished rulebook. It tries to set the terms of debate, what Congress should regulate, what it should avoid, and which risks deserve priority. For practitioners and researchers, the real question is whether this blueprint translates into operational protections or stays at the level of messaging. At the “explainer” level, the document groups its recommendations into seven areas: child safety, community protections, copyright, free speech, innovation, workforce training, and federal preemption. That structure is useful because it shows what the administration wants Congress to touch first. But it also signals a trade-off: breadth over depth, where each section can point in a direction without specifying standards, thresholds, or enforcement muscle. The deepest structural claim is the push for federal preemption, the idea that AI rules should be primarily national, not state-by-state. In theory, one standard could reduce compliance chaos and make cross-state deployment simpler. In practice, preemption is not neutral: it decides whether state-level guardrails become a testing ground for better protections, or get wiped out before they mature. On child safety and community protections, the framework’s instincts are broadly aligned with what we would expect from a risk-based approach: prioritize the most vulnerable and the most scalable harms. Yet “protect children” can become a banner that hides hard design questions, age assurance, data minimization, safe defaults, and meaningful auditing. Without concrete requirements (what must be tested, logged, and independently verified), the language risks becoming aspirational while harmful systems remain deployable. The copyright section is where the framework’s “innovation-first” posture shows most clearly, leaning toward legal permissiveness around training while suggesting courts sort out key disputes. That approach may reduce friction for model development, but it pushes uncertainty downstream onto institutions buying or deploying tools, universities, hospitals, and startups. When provenance is unclear, we end up normalizing “trust us” procurement, which is a weak foundation for public legitimacy and research reproducibility. The free speech framing also does important signaling, but it can blur a crucial distinction: protecting expression is not the same as avoiding accountability for amplification, targeting, fraud, or high-impact deception. If the policy conversation collapses into “regulation versus speech,” we lose precision about what should be regulated: measurable harms, manipulative design patterns, and negligent deployment in sensitive contexts. A framework can defend rights while still demanding auditable safety behaviors from powerful systems. Where the second half of the framework and the broader “DEEP DIVE” conversation around it, still feels flou, is in the missing operational spine. We want clearer definitions (what counts as a high-risk system), clearer obligations (what testing is mandatory before deployment), and clearer governance (incident reporting, red-team standards, independent audits, and post-deployment monitoring). “Regulatory sandboxes” are not a substitute for baseline protections; without stop-rules and external oversight, sandboxes can become a faster lane to release rather than a safer lane to evaluate. Finally, the framework under-specifies the clinical and research reality: privacy is not a footnote, evaluation is not optional, and “workflow integration” is where safety either holds or collapses. If preemption reduces state pressure without replacing it with enforceable federal standards, we risk a vacuum where vendors set the bar and institutions quietly absorb the risk. The framework becomes strongest when it turns values into requirements, what we must test, document, disclose, and monitor, because that is the only way “national leadership” becomes something we can actually practice.

English

Talking to Spreadsheets: What It Really Means to Use ChatGPT for Excel Work That Must Stay Correct

In research and clinical settings, Excel persists because it is fast, familiar, and flexible. Screening logs, adverse-event trackers, clinic volume summaries, and quality-improvement datasets often begin (and sometimes remain) as spreadsheets. What feels newly consequential is the possibility of working through language: describing what we want done and having an AI system build, update, analyze, or troubleshoot while leaving the spreadsheet’s layout and formulas largely intact. For teams already burdened by documentation and reporting cycles, that shift is not trivial. From the perspective of an experienced clinician-researcher, the appeal is less “automation” than the reduction of brittle, time-consuming micro-tasks. A surprising amount of spreadsheet labor involves extending a pattern across sheets, repairing references after columns change, harmonizing date and text formats, or generating consistent summaries under time pressure. Natural-language interaction can serve as a specification layer over formula work: “Add a flag for missed visits using our existing definition,” or “Extend this table to include the new site without changing the report format.” When it works, it allows attention to return to design decisions rather than keystrokes. The preservation of formatting, formulas, and structure matters more than it may sound. Many real-world spreadsheets encode institutional memory in their structure: color conventions, locked cells, named ranges, hidden calculation tabs, and formulas that implement local definitions. An AI assistant that edits aggressively, rebuilding tables, flattening formulas, or rearranging columns, can break downstream use even if the “answer” looks correct. The practical requirement, therefore, is not only correctness of outputs but respect for the spreadsheet as a system with dependencies. Building and updating are often the safest entry points. Adding new calculated columns, generating data-validation rules, or creating a summary sheet can be done in ways that are auditable and reversible, especially if the assistant is instructed to place changes in a new tab or clearly marked area. In clinical audit work, for instance, a natural-language request to create a monthly run chart or pivoted summary can save time, but it should also produce formulas that are visible and checkable. The goal is not to hide the work, but to make it quicker to draft and easier to review. Error diagnosis is where benefits and risks rise together. Spreadsheet errors are typically quiet: a mixed absolute/relative reference, a SUMIF range that fails to extend to the newest rows, a text-to-number conversion that quietly produces zeros, or a lookup that breaks when identifiers change format. An AI system can often propose plausible causes and minimal corrections, which is genuinely helpful when a deadline is near. Yet “minimal” is contextual; even a one-cell fix can alter denominators, eligibility flags, or baseline values in clinically meaningful ways. Analysis through natural language can also change who participates in interpretation. Not everyone in a multidisciplinary team reads nested formulas comfortably, and that gap can concentrate power in the hands of the person who “knows Excel.” If an assistant can translate a request, “Summarize no-show rates by site and month, and show how missing values were handled”, into transparent steps and clearly labeled outputs, the spreadsheet becomes more legible. That legibility has practical value: better peer review of calculations and fewer analytic choices hidden inside formulas. Still, there are tensions that cannot be solved by interface design alone. Natural language is ambiguous, while spreadsheets are literal; “clean the data” can mean anything from trimming spaces to redefining categories. AI-generated formulas may look convincing while being subtly wrong, and summaries can miss artifacts such as duplicated rows, shifted time windows, or changes in coding practice. For this reason, the most responsible use is procedural: versioning, before/after reconciliation totals, spot checks on known cases, and separation of raw data from transformed and reported outputs. Ethically, the central issues are transparency, privacy, and accountability rather than novelty. If patient-identifiable data are sent to an external tool without appropriate governance, no level of convenience justifies the breach of trust. Even in secure enterprise environments, teams should be explicit about where AI was used, what was changed, and who approved the final dataset or report. Data integrity is an ethical commitment: it requires an audit trail, a clear division of responsibility, and a refusal to treat AI output as self-validating. In practice, I have found it helpful to treat AI assistance as a junior collaborator: useful for drafting transformations, proposing checks, and explaining formula logic, but not a substitute for methodological judgment. Asking the system to show its work, formulas, assumptions, handling of missingness—and constraining it to preserve structure can reduce unintended disruption. The more consequential the spreadsheet (clinical decisions, regulatory reporting, publishable results), the more stringent the validation should be. Used this way, natural-language tools can support reliability rather than merely speed. Looking forward, the most meaningful shift may be cultural. Natural-language interaction encourages us to articulate definitions (“What counts as a missed visit?” “Which date anchors follow-up?”) before encoding them in formulas. If we pair that articulation with disciplined verification, we may end up with spreadsheets that are not only faster to maintain but also easier to audit, teach, and trust. In clinical research, that combination, clarity plus accountability, is the real promise.

English

When Evidence Meets Interface: Wiley–OpenEvidence and the Next Step in Clinical Knowledge Access

In clinic, the question that stalls us is rarely the “textbook” one. It is the oddly specific patient in front of us, the medication interaction that is plausible but not obvious, the guideline nuance that changed last year, the subgroup analysis we half-remember but cannot reliably quote. Against that reality, Wiley’s March 3, 2026 announcement of a partnership with OpenEvidence feels less like a technology headline and more like a workflow intervention: a major publisher is licensing a deep medical corpus into a point-of-care AI system designed to retrieve, synthesize, and cite biomedical evidence under time pressure. What is notable is not simply that “AI is coming to medicine”, we have lived with clinical search tools and decision supports for decades, but that this agreement explicitly centers a familiar truth: good answers depend on the quality and integrity of the underlying literature. Wiley frames the problem in terms every clinician recognizes, namely the expanding volume of research and the persistent lag between publication and practical uptake. OpenEvidence, in turn, positions itself around evidence-grounded answering, aiming to keep clinicians close to citable sources rather than drifting into untraceable summarization. The scope described in the announcement is not trivial. It includes access to Cochrane content such as the Cochrane Database of Systematic Reviews and Cochrane Clinical Answers, alongside hundreds of Wiley peer‑reviewed journals and a broad collection of journals and books spanning multiple specialties. In principle, this matters because systematic reviews and structured clinical answers sit closer to the “actionable middle” of evidence-based practice, where trainees and clinicians often need synthesis that remains tethered to methods and citations. At the same time, the partnership makes visible a constraint that many end users misunderstand: licensing full text for computation does not automatically mean full text can be freely displayed. In publisher ecosystems, the version of record is governed by copyright and sharing policies, and the practical result is often that a platform can analyze full text internally while presenting users with references, links, and limited quotations rather than reproducing articles in full. This arrangement protects intellectual property but also creates a pedagogical tension, because clinicians and learners may feel they are being asked to trust a summary without immediate access to the complete argument and methods. From a clinical workflow perspective, the promise is speed without surrendering traceability. If an AI tool can answer, “What is the evidence for X in population Y?” and immediately point us to the most relevant systematic review, pivotal trial, or clinical reference text, ideally with enough context to judge applicability, it can reduce the low-value time we spend searching across interfaces. In practice, the difference is not merely convenience: it can preserve cognitive bandwidth for the work that only humans can do, such as integrating comorbidities, patient values, feasibility, and local resource constraints. For medical students and trainees, we understand the instinct to begin with general-purpose chat systems. Tools like ChatGPT, Gemini, and Claude can be helpful for tutoring, clarifying concepts, and organizing study plans. The problem is that fluency can masquerade as reliability, and in medicine a plausible-sounding answer that cannot be audited is not a small error, it is a liability. The responsible posture is to treat general systems as drafting or learning aids, while treating evidence-seeking as a different category of task that demands citations, provenance, and the ability to verify claims against primary sources. This is where specialized platforms may be more appropriate, not because they are “perfect,” but because their design incentives can be better aligned with evidence-based practice. A system built for medical Q&A that is intended to ground responses in peer‑reviewed literature and expose a clear citation trail supports how we teach and practice: ask, acquire, appraise, apply, and reassess. In our teaching settings, we often emphasize that the goal is not an “answer” but an answer with an audit trail, something a learner can defend at the bedside and a clinician can revisit when circumstances change. We should also acknowledge the limitations and tensions that accompany publisher-integrated AI. Any licensed corpus has edges: what is included, what is excluded, which specialties are best represented, and which years or formats are more accessible. If a system’s strongest access is concentrated within particular publishing portfolios, the retrieval layer may preferentially surface those sources unless balancing is explicit and measurable. And, of course, the biomedical literature itself contains publication bias, changing standards, and uneven global representation, meaning that “more content” does not automatically produce better clinical judgment. Ethically, partnerships like this heighten responsibilities around transparency, accountability, and data integrity. We should expect clear communication about what content is being searched, how citations are selected, and how uncertainty is handled, particularly when evidence is weak or conflicting. We also need institutional clarity about privacy: trainees must not paste identifiable patient details into external tools unless the platform is formally approved, secured, and governed. The ethical north star is not to celebrate AI or reject it, but to demand that AI-supported workflows preserve human responsibility and keep the evidence chain visible. Looking forward, we can read this collaboration as an early signal of a broader shift: publishers recognizing that discovery is moving from static databases toward interactive evidence interfaces, and AI platforms recognizing that trust depends on licensed, curated, peer‑reviewed foundations. For clinicians, researchers, and graduate learners, the opportunity is real, faster access to better-grounded synthesis at the point of need. The obligation is equally real: to read beyond the summary when stakes warrant it, to appraise what we retrieve, and to insist that “AI-supported” never becomes a substitute for clinical reasoning or scholarly discipline.

English

Original words, reused ideas: the reality of AI‑generated research plagiarism

Not everything that looks new is truly new. Generative AI can be a real productivity boost in research, useful for outlining, rewriting, translating, and even brainstorming when you’re stuck. But the same fluency that makes AI outputs feel “publishable” can also hide a quieter risk: AI-generated work can unintentionally recycle existing scholarship, not always through copied sentences, but through reused ideas, familiar argumentative scaffolds, replicated study logic, and look‑alike novelty. This is the reality many researchers are now facing: the integrity problem is often not blatant plagiarism, but hidden reuse, where the writing is original in wording while the intellectual contribution is thin, derivative, or too close to what already exists. When that happens, “novelty” becomes a surface effect, and research integrity becomes less about catching copied text and more about protecting genuine originality, traceable reasoning, and honest contribution. A strong example comes from Gupta and Pruthi’s ACL 2025 study, All That Glitters is Not Novel: Plagiarism in AI Generated Research (an Outstanding Paper award at ACL 2025). Their focus isn’t simple copy-paste plagiarism, but a more subtle kind: research-style text that rephrases or recombines prior work in ways that can look “new” during a fast review process. Instead of asking reviewers to judge novelty in the usual way, the authors designed an expert-led setting where participants were explicitly tasked to look for plagiarism sources. They had experts evaluate 50 LLM-generated research documents (including documents from “The AI Scientist” and other public proposals, plus newly generated ones). This matters because typical evaluations often assume good faith and don’t incentivize active source-hunting. They also used a clear rubric: the top scores corresponded to cases where there’s essentially a one-to-one mapping between the generated methodology and earlier work, or where substantial parts are borrowed from a small set of prior papers without credit. In other words, it’s not about identical sentences, it’s about the intellectual skeleton of the method and contribution being too close to something that already exists. The headline result is hard to ignore: experts flagged 24% of the 50 documents as plagiarism (scores 4–5) after verification steps that included contacting original authors; if you also count cases where verification wasn’t possible (e.g., authors unreachable), the rate rises to 36%. That gap is important, because it shows how “confirmed” cases may still be an undercount when real-world verification is slow or impossible. This is exactly why AI-era plagiarism can feel different: the risk often sits at the idea level, problem framing, method pipeline, and contribution claims, rather than in identical phrasing. If a proposal is written confidently, packaged with clean sections, and sprinkled with plausible citations, it can pass a quick surface check even when the underlying concept is not truly original. The study also highlights a second problem: automation doesn’t save us (yet). The authors report that several automated approaches, including embedding-based search and a commercial plagiarism service, were inadequate for detecting plagiarism in these LLM-generated proposals. That’s consistent with a broader reality: “semantic borrowing” is much harder to catch than overlapping strings of text. For peer review, this creates a nasty workload tradeoff. If AI increases the volume of polished submissions while also increasing the probability of hidden borrowing, reviewers must spend more time doing detective work, searching literature, mapping methods, and checking whether “novel contributions” are just renamed versions of known ideas. That pressure doesn’t just slow review; it can also push reviewers toward shallow heuristics, which makes the system even easier to game. For writers who use AI ethically, the safest mindset is: AI can help you express your ideas, but it should not be the source of your contribution. Keep a “provenance trail”: what you read, what you copied into notes (with quotes), and what you personally decided. If AI suggests a method or framing, treat it like an untrusted hint, then verify by searching for prior work and adding explicit citations where your idea connects to existing literature. For universities, journals, and conferences, the response shouldn’t be panic, it should be process upgrades. Require transparent disclosure of AI use, strengthen novelty checks (especially at the idea/method level), and give reviewers tools/time to do targeted source-searching when something feels “too clean.” Most importantly, reward careful citation and honest positioning (“this is an extension of X”) rather than over-marketing novelty, because in the AI era, exaggerated novelty is becoming easier to manufacture than real research progress. Finally, this is where human reviewers and human judgment remain essential, and where responsibility can’t be outsourced. Because idea-level reuse is subtle, it often takes domain expertise to notice when a “new” pipeline is really a renamed or lightly rearranged version of established work. In other words, integrity in the AI era depends less on automated flags and more on careful reading, source-checking, and accountable editorial processes. The same caution applies in clinical contexts: many AI tools on the market are still not reliable enough to be treated as clinical-grade systems, and they can produce confident but wrong, biased, or unsafe outputs. In therapy settings especially, we should treat AI as supportive workflow software, not an authority, keeping clinicians firmly responsible for interpretation, decisions, and client safety.

English

When Algorithms Meet Interaction : A Clinician‑Researcher’s Critical Reading of ESLA’s 2026 AI Position Paper

In a busy speech and language therapy week, “AI” doesn’t arrive as a philosophical debate. It arrives as a friction point: Can we trust automated transcription for a multilingual child? Should we trial a screening app to reduce waiting lists? Can a generative tool draft home-program instructions without drifting beyond evidence? ESLA’s position paper, Shaping the Future of Speech and Language Therapy with Artificial Intelligence (published March 6, 2026), lands right in this reality. It acknowledges that AI is rapidly reshaping healthcare, and it correctly frames SLT as a profession built on interaction, context, and judgement. The problem is that when clinicians turn to a position paper, many of us aren’t only looking for a vision, we’re looking for direction. To ESLA’s credit, the paper avoids the two common extremes: adopting tools because they’re new, or rejecting them because they feel risky. It states a sensible professional stance: AI should support rather than replace therapists, and implementation must respect human rights, dignity, diversity, and inclusion. But here’s the tension: those principles are true, and still too general to steer day-to-day decisions. In practice, “support not replace” needs to translate into operational boundaries: Which tasks are acceptable to automate? Which tasks must remain clinician-led? What level of human oversight is “enough” in real services under pressure? Without that, the document risks becoming a shared set of values that everyone agrees with, while actual practice drifts in whichever direction vendors, budgets, or time constraints push it. ESLA outlines opportunities (screening, assessment support, intervention planning, outcome monitoring, personalised care, hybrid delivery, access). Again, plausible, and in many areas, exciting. Yet the paper could have pushed further into the uncomfortable practicalities: the reality that “personalisation” can become “fit to the majority,” especially for multilingual children, minoritised dialects, AAC users, complex disability profiles, and culturally specific communication norms. If ESLA’s goal is equity, we need stronger statements on minimum evidence expectations, not just aspirations. What counts as sufficient validation for multilingual speech recognition? What sample sizes? What types of accents/dialects? What error rates are acceptable and acceptable for whom? Otherwise, “equity” stays a hope rather than a measurable requirement. Where the paper feels strongest is its ethical backbone, privacy, consent, data ownership, bias, unequal access, and over-reliance. But ethically strong does not automatically mean clinically usable. This is where many clinicians will say: It’s nice, but… c’est flou encore. We wanted more “what to do / what not to do.” We wanted a toolkit. We wanted ESLA to name the concrete risks that show up in real workflows: staff uploading voice samples into unknown cloud services; student clinicians pasting identifiable details into chatbots; managers adopting “AI screening” because it’s cheaper, then letting it quietly become gatekeeping. A position paper can’t solve everything, but it can draw bright lines and this one often speaks in principles where the field is asking for guardrails. What would make this paper more actionable is a clinician-facing set of guidelines that reads like a decision support tool, not a manifesto. For example: a “red / amber / green” use-case list (e.g., green = admin summarisation with de-identified data; amber = draft therapy materials with clinician review; red = automated diagnosis/eligibility decisions). A privacy verification checklist (Where is data stored? Is it used to train models? Can we opt out? Who is the data processor/controller? Is encryption stated? What is retention/deletion policy? Can we get a Data Processing Agreement? Is there audit logging?). A minimum evaluation standard (evidence thresholds, bias testing expectations, multilingual performance reporting, disclosure requirements). And even a “vendor questions” one-pager managers can use before procurement. Without these, professional leadership remains a principle, but clinicians and services are left to invent policy locally, inconsistently, and often under time pressure. ESLA’s conclusion calls for deliberate, collaborative, ethical adoption so AI strengthens participation, dignity, and inclusion. That’s a good direction. But the next step should be explicit: an implementation appendix that turns values into practice. Because the profession doesn’t just need permission to engage with AI, we need structure to do it safely. Until then, ESLA’s paper functions as a strong ethical positioning statement, but not yet as the practical compass many of us hoped for. In other words, the paper is important, timely, and well-intentioned, but it still needs more actionable guidance for real-world practice.

English

ChatGPT 5.3 (“Instant”) vs 5.4 (“Thinking”) for Therapists: What Helps, What Hurts, and What Must Come First

In research and in therapy, the hardest part of the week is often not the “clinical moment” or the “research idea” itself. It’s the accumulation of in-between work: turning rough notes into coherent formulations, translating technical concepts into language a client can live with, and documenting care in a way that is both faithful and legible. Against that reality, the most useful way to approach ChatGPT 5.3 and 5.4 is not as a question of novelty, but as a question of workflow. What can these tools responsibly reduce, and what new risks do they introduce? Before you compare speed or “intelligence,” ask the compliance question readers actually need answered: is this tool HIPAA‑approved for PHI, or not? HIPAA doesn’t officially certify software as “approved”; in practice, the baseline is whether you can get a Business Associate Agreement (BAA) and enforce the right retention/access controls. OpenAI states that using its API with PHI requires a BAA, and that a BAA for ChatGPT is currently available only to sales‑managed ChatGPT Enterprise or Edu customers, not to ChatGPT Business. So if you can’t contractually and operationally defend how patient data is handled, you shouldn’t treat the system as an extension of your clinical workspace, whether you’re in the U.S. under HIPAA or elsewhere under professional ethics and local privacy expectations. Once guardrails are clear, the 5.3/5.4 split becomes genuinely useful. ChatGPT 5.3 (“Instant”) reads like a model tuned for everyday flow: fast, smooth, and often excellent at turning messy ideas into clean language. In a clinical workflow, that matters because it targets the small tasks that become overwhelming at scale, drafting psychoeducation, rewriting intake instructions, simplifying coping-skill explanations, or making client-facing language less technical. It can also quickly produce several variants so you can choose the tone that fits a client’s age, culture, and readiness. The clinical value is real, but it’s also deceptively easy to misuse. When a model produces confident prose quickly, it can blur the boundary between translation (changing style while preserving meaning) and invention (quietly filling gaps with plausible-sounding content). That boundary matters because clinical documents are not just “writing”, they’re records that shape continuity of care, reimbursement, supervision, and how clients are understood by other systems. If you use 5.3 for notes or summaries, the safest stance is: it drafts language; you own the facts. ChatGPT 5.4 (“Thinking”) tends to be most helpful when the task has internal structure, where you need the tool to keep track of dependencies across steps rather than just rewrite a paragraph. In research, that might mean tightening a methods section without changing meaning, drafting a transparent analysis-plan template, or turning scattered meeting notes into a protocol outline you can critique. In clinical contexts, it can help you generate alternative hypotheses, map maintaining factors, or create decision-tree prompts for your own reflection. The benefit isn’t “it knows therapy,” but that it can help you organize complexity when you’re fatigued. That same coherence, however, introduces a specific kind of risk: a “complete-sounding” narrative that feels psychologically compelling even when it’s not accountable to the actual relationship, the client’s culture, or your observed data. 5.4 can produce formulations that read impressively integrated, yet still be subtly wrong, overly confident, or prematurely interpretive. In therapy, that can show up as false precision: labels that land too early, causal stories that are too tidy, or summaries that inadvertently overwrite the client’s own meanings. The more elegant the output, the more vigilant the clinician needs to be. Pricing matters because it quietly determines who uses what, how often, and under which safeguards, and that, in turn, shapes clinical risk. OpenAI’s consumer tiers are commonly framed as Go (~ $ 8/month in the U.S.), Plus ( $ 20/month), and Pro ( $ 200/month), while ChatGPT Business is listed at ** $ 25 per user/month billed annually** (and the pricing page advertises “Try for free”). Prices can also be region-dependent, so what you see in Lebanon may differ from U.S. stickers. The honest clinical question, then, is not “Which is cheapest?” but: Which plan aligns with our confidentiality requirements and governance? Lower-cost personal tiers can nudge clinicians toward “quick personal account” use (often with weaker admin controls), while Business/Enterprise tiers are designed for organizational controls, yet they still come with practical constraints (e.g., “unlimited” is typically subject to abuse guardrails, and some tiers have product/feature caveats; Go may include ads). A realistic way to use these tools well is to divide tasks by risk level. Low-stakes, high-value uses include drafting client handouts (without identifiers), rewriting psychoeducation, generating multiple wording options for a message you will review, or creating session structure templates. Higher-risk areas, diagnosis, risk assessment, duty-to-warn decisions, and definitive case formulation, should remain human-led, with the tool used only to support clarity of communication, not clinical judgment. The goal is not to “ban” the tool; it’s to keep it in a role that improves quality without quietly shifting responsibility. If there’s one operational lesson, it’s that these tools are best treated like exceptionally fluent interns: fast, helpful, and sometimes brilliant at presentation, but not accountable for truth. They can reduce friction and widen your drafting bandwidth, especially during heavy weeks, but only if you design a workflow where verification is normal rather than optional. In other words, the right question is not “Can it write this?” but “Can I audit what it wrote, and can I defend it ethically and clinically?” Ethically, the most important moment is often when the tool feels easiest to use. Transparency is not only a client-facing issue (“AI helped draft this handout”) but a professional integrity issue: what was delegated, what was verified, and what data entered the system. Responsibility means refusing to outsource clinical judgment, risk assessment, diagnosis, or case formulation, to a tool that cannot carry a duty of care. Data integrity, in turn, means keeping documentation anchored to observed clinical facts and the client’s words, and using governance-minded frameworks to decide what is appropriate to automate, what must remain human,

English

When “It Feels Private” Isn’t the Same as “It Is Private”: Teens, Chatbots, and the New Privacy Gap

In our clinical and research work with adolescents, privacy is not a side issue, it is a developmental workspace where autonomy, identity, and boundaries are tested in real time. The American Psychological Association (APA) has emphasized that wanting privacy is normal and healthy in adolescence, especially when teens are exploring identity, relationships, and new emotions. When privacy is treated as inherently suspicious, we often intensify secrecy rather than strengthen judgment. At the same time, we are seeing a shift in where privacy is practiced. Many teens now process questions and feelings in conversations with AI chatbots. As Dr. Joshua Goodman notes, “Conversations with AI chatbots can feel like a safe space” for adolescents to discuss things they would not feel safe bringing to parents or others. In practice, that “safe space” feeling often comes from immediacy, lower embarrassment, and the absence of visible adult worry. The interface can feel calm, nonreactive, and available, especially for teens who expect criticism or conflict. From a therapist’s standpoint, this is precisely why we cannot treat chatbot use as a trivial trend or a purely parental concern. We have a professional responsibility to follow adolescents’ real help-seeking pathways, including the digital ones, and to make sure that what feels private is not mistaken for what is protected. If a teen is using a “kid chatbot” as their primary container for shame, identity questions, sexual concerns, trauma narratives, or self-harm thoughts, then chatbot use becomes clinically relevant behavior. It belongs in assessment, formulation, and ongoing treatment planning the same way we track sleep, peer relationships, substance use, and social media exposure. The central clinical concern is simple: perceived safety is not the same as real safety. Dr. Mary Alvord states it directly: “Perceived safety isn’t the same as real safety.” When a chatbot feels validating and discreet, teens may assume it is confidential in the way a therapeutic relationship is meant to be. But teens often use the word “private” to mean “no one will judge me,” “no one will get angry,” or “no one will make it awkward.” Data privacy asks different questions: who can access the content, whether it is stored, whether it is analyzed, and how it might be reused. This is where our clinical role is not to frighten teens, but to clarify reality. Psychoeducation about privacy is not a parent lecture; it is a therapeutic intervention. Just as we help adolescents build mental models of consent, emotional regulation, risk, and relationships, we can help them build a mental model of digital disclosure. A chatbot can be emotionally comfortable while still being structurally nonconfidential. An interface can feel intimate without offering the protections we associate with private human relationships. Importantly, the therapist’s task is not only to warn; it is to understand the function the chatbot serves. In sessions, we can ask: What platform do you use? When do you turn to it? What do you ask it for—comfort, advice, distraction, validation, “permission,” reassurance? How do you feel afterward—relieved, calmer, more distressed, more confused, more dependent on reassurance? These questions help us assess whether the chatbot is supporting coping or quietly strengthening avoidance, rumination, or isolation. They also help us identify a common pattern in adolescence: the teen is not necessarily choosing a chatbot over people; they may be choosing it over anticipated judgment. Because we hold a duty of care, we also have a responsibility to name the “privacy gap” explicitly: many adolescents do not know what happens to their data. They may assume that because the interaction is one-on-one and “sounds caring,” it is private in the same way that a clinician’s office is private. Our role is to bring the invisible layer into view, gently, concretely, and repeatedly, so that teens can make informed choices rather than emotionally-driven assumptions. This is especially important with younger users and “kid” versions of chatbots, where the design may inadvertently amplify the sense of safety while the teen’s understanding of data practices is still developing. One effective approach is to translate privacy into a skill-building exercise rather than a moral warning. In practice, we can guide adolescents to sort what they might share with a chatbot into clear categories: low-stakes content (study help, hobbies, neutral questions), personally identifying details (full name, school, address, phone number, account handles), and high-sensitivity disclosures (sexual content, self-harm thoughts, trauma narratives, family conflict, screenshots of private messages, and medical or mental health details). The goal is not to ban curiosity or punish disclosure. The goal is to build discrimination: which topics can tolerate a lower-protection environment, and which topics deserve a safer channel—trusted adults, clinicians, or crisis supports when needed. From there, we can teach a practical rule that adolescents can actually use: the more specific and identifiable the details are, the higher the potential cost if the content is stored, misunderstood, shared, or later accessed. We can also strengthen what developmental psychology already tells us adolescents are practicing: “future-self thinking.” We can ask: How would you feel if a parent, peer, school, or future partner saw this? Would you still want it written down somewhere? Could it be taken out of context? This anticipatory reflection is developmentally appropriate because foresight and risk appraisal are still consolidating in adolescence. Practiced regularly, it turns privacy from an abstract value into a usable decision-making habit. An ethical reflection is unavoidable because the issue is not only teen behavior, but also system design and adult responsibility. Teens and caregivers deserve transparent explanations of what chatbots can and cannot promise, especially when the interface feels intimate. As clinicians, we cannot outsource clinical judgment, or the protection of vulnerable disclosures, to systems without human accountability and professional ethics. Yet we also have to hold complexity: some teens have limited access to responsive adults, and chatbots may function as a temporary bridge. Our responsibility, then, is not to deliver blanket approval or bans. It is to stay close to adolescents’ real lives, to ask about chatbot use without shaming, to clarify the

English

Quand « ça semble privé » n’est pas la même chose que « c’est privé » : adolescents, chatbots et le nouvel écart de confidentialité

Dans notre travail clinique et de recherche auprès des adolescents, la vie privée n’est pas une question annexe ; c’est un espace de développement où l’autonomie, l’identité et les limites sont testées en temps réel. L’American Psychological Association (APA) a souligné que le désir d’intimité est normal et sain à l’adolescence, en particulier lorsque les jeunes explorent leur identité, leurs relations et de nouvelles émotions. Lorsque la confidentialité est considérée comme intrinsèquement suspecte, nous avons souvent pour effet d’accroître le secret plutôt que de renforcer le jugement. Dans le même temps, nous observons un déplacement des lieux où s’exerce cette intimité. Beaucoup d’adolescents traitent désormais leurs questions et leurs émotions dans des conversations avec des chatbots d’IA. Comme le note le Dr Joshua Goodman, « Les conversations avec des chatbots d’IA peuvent donner aux adolescents l’impression d’un espace sûr » pour aborder des sujets qu’ils n’oseraient pas apporter à leurs parents ou à d’autres. En pratique, ce sentiment « d’espace sûr » vient souvent de l’immédiateté, d’une moindre gêne et de l’absence d’inquiétude visible chez l’adulte. L’interface peut paraître calme, non réactive et disponible, surtout pour des adolescents qui s’attendent à de la critique ou au conflit. Du point de vue du thérapeute, c’est précisément pourquoi nous ne pouvons pas traiter l’usage des chatbots comme une tendance anodine ou une préoccupation uniquement parentale. Nous avons la responsabilité professionnelle de suivre les véritables voies de recherche d’aide des adolescents, y compris numériques, et de nous assurer que ce qui « semble » privé n’est pas confondu avec ce qui est protégé. Si un adolescent utilise un « chatbot pour enfants » comme principal contenant pour la honte, les questions d’identité, les préoccupations sexuelles, les récits de traumatismes ou les pensées d’auto‑agression, alors l’usage du chatbot devient un comportement cliniquement pertinent. Il a sa place dans l’évaluation, la formulation et la planification thérapeutique continue, tout comme nous suivons le sommeil, les relations avec les pairs, l’usage de substances et l’exposition aux réseaux sociaux. La préoccupation clinique centrale est simple : la sécurité perçue n’est pas la sécurité réelle. La Dre Mary Alvord le dit clairement : « La sécurité perçue n’est pas la sécurité réelle. » Lorsqu’un chatbot paraît validant et discret, les adolescents peuvent supposer qu’il est confidentiel comme une relation thérapeutique est censée l’être. Or, les adolescents emploient souvent le mot « privé » pour signifier « personne ne me jugera », « personne ne se mettra en colère » ou « personne ne rendra la situation gênante ». La protection des données pose d’autres questions : qui peut accéder au contenu, s’il est stocké, s’il est analysé et comment il pourrait être réutilisé. C’est ici que notre rôle clinique n’est pas d’effrayer les adolescents, mais d’éclairer la réalité. La psychoéducation sur la confidentialité n’est pas une leçon parentale ; c’est une intervention thérapeutique. De la même manière que nous aidons les adolescents à construire des modèles mentaux du consentement, de la régulation émotionnelle, du risque et des relations, nous pouvons les aider à bâtir un modèle mental de la divulgation numérique. Un chatbot peut être confortable sur le plan émotionnel tout en restant non confidentiel sur le plan structurel. Une interface peut sembler intime sans offrir les protections que nous associons aux relations humaines privées. Il est important de rappeler que la tâche du thérapeute n’est pas seulement de mettre en garde ; il s’agit aussi de comprendre la fonction que sert le chatbot. En séance, nous pouvons demander : Quelle plateforme utilises‑tu ? À quels moments y recours‑tu ? Qu’en attends‑tu réconfort, conseils, distraction, validation, « permission », rassurance ? Comment te sens‑tu ensuite, soulagé, plus calme, plus en détresse, plus confus, plus dépendant de la rassurance ? Ces questions nous aident à évaluer si le chatbot soutient l’adaptation ou, insidieusement, renforce l’évitement, la rumination ou l’isolement. Elles nous aident aussi à repérer un schéma fréquent à l’adolescence : le jeune ne choisit pas nécessairement un chatbot plutôt que des personnes ; il le choisit plutôt pour éviter un jugement anticipé. Parce que nous avons une obligation de protection, nous avons aussi la responsabilité de nommer explicitement « l’écart de confidentialité » : nombre d’adolescents ignorent ce qu’il advient de leurs données. Ils peuvent supposer que, parce que l’interaction est en tête‑à‑tête et « donne une impression de bienveillance », elle est privée au même titre qu’un cabinet de clinicien. Notre rôle est de rendre cette dimension invisible plus concrète, avec douceur, concrètement et de façon répétée, afin que les adolescents puissent faire des choix éclairés plutôt que de s’en remettre à des suppositions dictées par l’émotion. Cela est particulièrement important chez les plus jeunes utilisateurs et dans les versions « enfant » des chatbots, où la conception peut involontairement amplifier le sentiment de sécurité alors même que la compréhension des pratiques de données est encore en développement. Une approche efficace consiste à traduire la confidentialité en un exercice de développement de compétences plutôt qu’en un avertissement moral. En pratique, nous pouvons guider les adolescents pour trier ce qu’ils pourraient partager avec un chatbot en catégories claires : contenus à faible enjeu (aide aux devoirs, loisirs, questions neutres), éléments personnellement identifiants (nom complet, établissement scolaire, adresse, numéro de téléphone, identifiants de comptes) et divulgations hautement sensibles (contenu sexuel, pensées d’auto‑agression, récits de traumatismes, conflits familiaux, captures d’écran de messages privés, ainsi que données médicales ou de santé mentale). L’objectif n’est pas d’interdire la curiosité ni de culpabiliser les jeunes lorsqu’ils partagent trop. L’objectif est de développer le discernement : quels sujets peuvent un environnement moins protecteur, et quels sujets méritent un canal plus sûr, adultes de confiance, cliniciens, ou dispositifs de crise si nécessaire. À partir de là, nous pouvons enseigner une règle pratique réellement utilisable par les adolescents : plus les détails sont précis et permettent d’identifier la personne, plus le risque potentiel est élevé si le contenu est stocké, mal compris, partagé ou consulté ultérieurement. Nous pouvons également renforcer ce que la psychologie du développement nous apprend déjà que les adolescents exercent :

Shopping Cart