
A handheld ultrasound probe is a regulated medical device long before it is a product. Between the bench prototype and the first sale sits a stack of standards that govern how much energy the probe may emit, how hot it may run against skin, what its materials may leach, how its image crosses a wireless link, and which national door it walks through. The dull lines in this file fail probes far more often than the headline features do.
None of these limits appear on a spec sheet, and every one of them can stop a shipment.
Three IEC documents stack on top of each other to govern acoustic safety, and a handheld maker meets all three or none. IEC 60601-2-37 is the particular standard for ultrasonic diagnostic equipment, setting the rules for the displayed mechanical and thermal indices, the transducer surface-temperature ceiling, and the default display state at power-on. The mechanical index it governs is the peak rarefactional pressure in megapascals divided by the square root of the centre frequency, a proxy for cavitation risk, and the thermal index splits three ways for soft tissue, for bone at the focus, and for bone near the surface. Beneath that standard, IEC 62359 defines how the field is measured: a calibrated hydrophone scans the beam in a water tank, the pressure waveform is captured point by point, and the derated intensities are computed with the 0.3 decibel per centimetre per megahertz model that converts a flattering tank reading into an estimated in-situ value. IEC 61157 then fixes the format in which those numbers are declared, letting a buyer compare two probes on one basis. The chain runs measurement, then limit, then declaration, and a probe that images beautifully while its hydrophone data is missing or out of tolerance carries no certifiable acoustic case, however good the picture looks on the phone.
Measure, then limit, then declare; drop one link and the chain holds no certificate.
ISO 14971 is the spine the whole file hangs from. It demands a documented chain from every hazard to a control and from every control to evidence, and the acoustic, thermal, electrical, and software risks each trace back to it. ISO 10993 governs the other contact question. The lens, the housing, and any gel-contact surface face testing for cytotoxicity, sensitisation, and irritation, since a probe presses against skin and sometimes mucosa across years of use. A material that passes inside a vented cart casing can fail in a sealed handheld where trapped heat and repeated cleaning chemistry age it faster, and the test that matters is the one run on the final material in its final form. A change of adhesive, a new pigment in the housing, or a switched supplier can reopen the biocompatibility question, and a quiet substitution on the production line is the kind of change that voids a clearance without anyone noticing until an audit.
Software and usability each became their own hazard class. A probe whose image lives in a phone app inherits the rules for medical-device software, version control, and cybersecurity, and a defect that freezes a frame or corrupts a measurement is a patient-safety event rather than a bug ticket. A probe that lets a tired clinician confirm a critical action with one careless tap, or buries the freeze control three menus deep, fails a use-related risk analysis even with flawless electronics. The usability-engineering standard treats foreseeable misuse as a design input, and a handheld probe used in a noisy resuscitation has less margin for an ambiguous button than a console in a quiet reading room.
The same probe meets three different gatekeepers depending on where it sells. In Europe, the EU MDR 2017/745 classifies an active diagnostic ultrasound device through its Rule 11 software and energy provisions, pushing it to Class IIa or higher and pulling in a notified body, a technical file, a unique device identifier, and post-market surveillance the old directive never demanded. In the United States, the FDA 510(k) Track 3 route asks the maker to show substantial equivalence to a legally marketed predicate device and to hold the acoustic output below the Track 3 ceilings, the 720 milliwatt per square centimetre derated intensity for general work and far lower for ophthalmic presets. In China, NMPA Class II registration runs its own type testing at a recognised laboratory and a clinical-evaluation path through a domestic standard set that mirrors the IEC family without copying it line for line. One probe, identical hardware, three dossiers, three timelines, three fee schedules, and three review queues that move at their own pace. A maker that designs for one door and bolts the others on late pays for the shortcut in resubmissions and lost quarters. The disciplined sequence builds one master technical file and maps each market’s annex onto it, and a single hydrophone dataset with one risk file then serves all three submissions rather than three parallel test campaigns.
One probe, three doors. The hardware is identical and the paperwork is not.
A wireless probe turns every study into data in transit, and two more standards govern that journey. DICOM 3.0 fixes how the image is tagged, stored, and pulled into a hospital record, and a probe whose app exports a flat picture with no patient metadata hands the radiology department a filing problem it never agreed to. IEC 60601-1-2 governs electromagnetic compatibility, and a probe carrying a radio has to prove it neither emits interference into neighbouring equipment nor falls over when a nearby device emits its own. A probe can clear every acoustic limit and still fail on one of these. A wireless link that leaks patient data, or an app that parks studies outside an access-controlled archive, turns a clean imaging device into a privacy liability the hospital inherits, and current MDR and FDA guidance treats that data path as part of the device rather than an accessory.

A notified body or an FDA reviewer does not retest the probe from scratch. The reviewer reads the file the maker built and checks that every claim has a measurement, every measurement a method, and every method a calibrated instrument behind it. A declared mechanical index points to a hydrophone scan, the scan points to a calibration certificate, and the certificate points to a national metrology institute. Break that traceability anywhere and the number becomes an assertion rather than evidence. The same logic runs through the biocompatibility data, the electromagnetic test reports, and the software lifecycle records. The strongest probe on the bench loses to the weaker probe with the cleaner file, since the review judges the documentation, and the documentation is the only object the reviewer can hold. A maker that treats the file as an afterthought assembled at the end, rather than a record kept from the first prototype, discovers the gaps during the audit, when closing them costs a full resubmission cycle and the months that come with it.
Several headline numbers on a probe are engineering choices the standards then box in. The piezoelectric array, 128 against 192 elements, sets the spatial sampling of the aperture, and a denser array resolves finer lateral detail at the cost of more channels to drive and more heat to shed. The beamformer running 64 against 32 channels decides how cleanly those elements focus, since each channel applies its own calculated time delay to steer and converge the beam, and a thin count smears the image at the edges of a steered sector and shortens the usable dynamic range. Display resolution matters as much as acquisition: a 256 against 128 gray-scale rendering separates tissue planes a coarser scale collapses into one shade, and a frame rate near 24 frames per second keeps a moving valve or a sliding pleura readable rather than strobed, with line density and depth trading against that rate in real time. These four figures decide whether the picture is diagnostic at the bedside, and none of them is the megahertz number the brochure leads with. A reviewer reads each one as a claim that needs test evidence, and a buyer should read them as the numbers that decide whether a fast scan is also a readable one.
The last group is where the wireless form pays its tax in full. The signal reaches the screen over WiFi 802.11n dual band, a hop that frees the probe from any cradle and adds tens of milliseconds of latency the cart never carried, or over a USB-C tether against WiFi that swaps the radio for a wire and a fixed host with no dropped frames. The sealed shell has to keep fluid out, and the gap between an IP68 and an IPX5 rating is the difference between a probe rated for immersion at a metre for thirty minutes during disinfection and one that only shrugs off a splash from any angle. Inside, power and heat fight over one budget: a dual against single battery design fixes how long a clinician scans before swapping, the reason a high-frequency probe runs hot and a cooling coating carries heat off the lens before it breaches the 43-degree surface limit, and a working-while-charging feature keeps the probe live across a long list at the cost of a warmer cell the risk file has to account for. Each one is a line in the same dossier, and a clinic that scans for hours meets these four choices long before it ever notices the megahertz rating on the box.
Twenty checks, one shipment. Any single failure holds the container at the port.
Reading the file in this order matters because the standards lean on each other. The acoustic measurement feeds the displayed indices, the indices feed the risk file, the risk file feeds the market submission, and the submission decides the label a clinician finally reads. The cheapest place to catch a problem is the bench, not the notified-body audit eighteen months later. A defect found at the prototype stage is a quiet redesign; the same defect found at the audit is a market entry pushed past its launch window and a queue of questions that runs for months.
Read the file, never the brochure. The probe that ships is the one whose paperwork holds.