Questions and Answers ​in MRI
  • Home
  • Complete List of Questions
  • …Magnets & Scanners
    • Basic Electromagnetism >
      • What causes magnetism?
      • What is a Tesla?
      • Who was Tesla?
      • What is a Gauss?
      • How strong is 3.0T?
      • What is a gradient?
      • Aren't gradients coils?
      • What is susceptibility?
      • How to levitate a frog?
      • What is ferromagnetism?
      • Superparamagnetism?
    • Magnets - Part I >
      • Types of magnets?
      • Brands of scanners?
      • Which way does field point?
      • Which is the north pole?
      • Low v mid v high field?
      • Advantages to low-field?
      • Disadvantages?
      • What is homogeneity?
      • Why homogeneity?
      • Why shimming?
      • Passive shimming?
      • Active shimming?
    • Magnets - Part II >
      • Superconductivity?
      • Perpetual motion?
      • How to ramp?
      • Superconductive design?
      • Room Temp supercon?
      • Liquid helium use?
      • What is a quench?
      • Is field ever turned off?
      • Emergency stop button?
    • Gradients >
      • Gradient coils?
      • How do z-gradients work?
      • X- and Y- gradients?
      • Open scanner gradients?
      • Eddy current problems?
      • Active shielded gradients?
      • Active shield confusion?
      • What is pre-emphasis?
      • Gradient heating?
      • Gradient specifications?
      • Gradient linearity?
    • RF & Coils >
      • Many kinds of coils?
      • Radiofrequency waves?
      • Phase v frequency?
      • RF Coil function(s)?
      • RF-transmit coils?
      • LP vs CP (Quadrature)?
      • Multi-transmit RF?
      • Receive-only coils?
      • Array coils?
      • AIR Coils?
    • Site Planning >
      • MR system layout?
      • What are fringe fields?
      • How to reduce fringe?
      • Magnetic shielding?
      • Need for vibration testing?
      • What's that noise?
      • Why RF Shielding?
      • Wires/tubes thru wall?
  • ...Safety and Screening
    • Overview >
      • ACR Safety Zones?
      • MR safety screening?
      • Incomplete screening?
      • Passive v active implants?
      • Conditional implants?
      • Common safety issues?
      • Projectiles?
      • Metal detectors?
      • Pregnant patients?
      • Postop, ER & ICU patients?
      • Temperature monitoring?
      • Orbital foreign bodies?
      • Bullets and shrapnel?
    • Static Fields >
      • "Dangerous" metals?
      • "Safe" metals?
      • Magnetizing metal?
      • Object shape?
      • Forces on metal?
      • Most dangerous place?
      • Force/torque testing?
      • Static field bioeffects?
      • Dizziness/Vertigo?
      • Flickering lights?
      • Metallic taste?
    • RF Fields >
      • RF safety overview?
      • RF biological effects?
      • What is SAR?
      • SAR limits?
      • Operating modes?
      • How to reduce SAR?
      • RF burns?
      • Estimate implant heating?
      • SED vs SAR?
      • B1+rms vs SAR?
      • Personnel exposure?
      • Cell phones?
    • Gradient Fields >
      • Gradient safety overview
      • Acoustic noise?
      • Nerve stimulation?
      • Gradient vs RF heating?
    • Safety: Neurological >
      • Aneurysm coils/clips?
      • Shunts/drains?
      • Pressure monitors/bolts?
      • Deep brain stimulators?
      • Spinal cord stimulators?
      • Vagal nerve stimulators?
      • Cranial electrodes?
      • Carotid clamps?
      • Peripheral stimulators?
      • Epidural catheters?
    • Safety: Head & Neck >
      • Additional orbit safety?
      • Cochlear Implants?
      • Bone conduction implants?
      • Other ear implants?
      • Dental/facial implants?
      • ET tubes & airways?
    • Safety: Chest & Vascular >
      • Breast tissue expanders?
      • Breast biopsy markers?
      • Airway stents/valves/coils?
      • Respiratory stimulators?
      • Ports/vascular access?
      • Swan-Ganz catheters?
      • IVC filters?
      • Implanted infusion pumps?
      • Insulin pumps & CGMs?
      • Vascular stents/grafts?
      • Sternal wires/implants?
    • Safety: Cardiac >
      • Pacemaker dangers?
      • Pacemaker terminology?
      • New/'Safe" Pacemakers?
      • Old/Legacy Pacemakers?
      • Violating the conditions?
      • Epicardial pacers/leads?
      • Cardiac monitors?
      • Heart valves?
      • Miscellaneous CV devices?
    • Safety: Abdominal >
      • PIllCam and capsules?
      • Gastric pacemakers?
      • Other GI devices?
      • Contraceptive devices?
      • Foley catheters?
      • Incontinence devices?
      • Penile Implants?
      • Sacral nerve stimulators?
      • GU stents and other?
    • Safety: Orthopedic >
      • Orthopedic hardware?
      • External fixators?
      • Traction and halos?
      • Bone stimulators?
      • Magnetic rods?
  • …The NMR Phenomenon
    • Spin >
      • What is spin?
      • Why I = ½, 1, etc?
      • Proton = nucleus = spin?
      • Predict nuclear spin (I)?
      • Magnetic dipole moment?
      • Gyromagnetic ratio (γ)?
      • "Spin" vs "Spin state"?
      • Energy splitting?
      • Fall to lowest state?
      • Quantum "reality"?
    • Precession >
      • Why precession?
      • Who was Larmor?
      • Energy for precession?
      • Chemical shift?
      • Net magnetization (M)?
      • Does M instantly appear?
      • Does M also precess?
      • Does precession = NMR?
    • Resonance >
      • MR vs MRI vs NMR?
      • Who discovered NMR?
      • How does B1 tip M?
      • Why at Larmor frequency?
      • What is flip angle?
      • Spins precess after 180°?
      • Phase coherence?
      • Release of RF energy?
      • Rotating frame?
      • Off-resonance?
      • Adiabatic excitation?
      • Adiabatic pulses?
    • Relaxation - Physics >
      • Bloch equations?
      • What is T1?
      • What is T2?
      • Relaxation rate vs time?
      • Why is T1 > T2?
      • T2 vs T2*?
      • Causes of Relaxation?
      • Dipole-dipole interactions?
      • Chemical Exchange?
      • Spin-Spin interactions?
      • Macromolecule effects?
      • Which H's produce signal?
      • "Invisible" protons?
      • Magnetization Transfer?
      • Bo effect on T1 & T2?
      • How to predict T1 & T2?
    • Relaxation - Clincial >
      • T1 bright? - fat
      • T1 bright? - other oils
      • T1 bright? - cholesterol
      • T1 bright? - calcifications
      • T1 bright? - meconium
      • T1 bright? - melanin
      • T1 bright? - protein/mucin
      • T1 bright? - myelin
      • Magic angle?
      • MT Imaging/Contrast?
  • …Pulse Sequences
    • MR Signals >
      • Origin of MR signal?
      • Free Induction Decay?
      • Gradient echo?
      • TR and TE?
      • Spin echo?
      • 90°-90° Hahn Echo?
      • Stimulated echoes?
      • STEs for imaging?
      • 4 or more RF-pulses?
      • Partial flip angles?
      • How is signal higher?
      • Optimal flip angle?
    • Spin Echo >
      • SE vs Multi-SE vs FSE?
      • Image contrast: TR/TE?
      • Opposite effects ↑T1 ↑T2?
      • Meaning of weighting?
      • Does SE correct for T2?
      • Effect of 180° on Mz?
      • Direction of 180° pulse?
    • Inversion Recovery >
      • What is IR?
      • Why use IR?
      • Phase-sensitive IR?
      • Why not PSIR always?
      • Choice of IR parameters?
      • TI to null a tissue?
      • STIR?
      • T1-FLAIR
      • T2-FLAIR?
      • IR-prepped sequences?
      • Double IR?
    • Gradient Echo >
      • GRE vs SE?
      • Multi-echo GRE?
      • Types of GRE sequences?
      • Commercial Acronyms?
      • Spoiling - what and how?
      • Spoiled-GRE parameters?
      • Spoiled for T1W only?
      • What is SSFP?
      • GRASS/FISP: how?
      • GRASS/FISP: parameters?
      • GRASS vs MPGR?
      • PSIF vs FISP?
      • True FISP/FIESTA?
      • FIESTA v FIESTA-C?
      • DESS?
      • MERGE/MEDIC?
      • GRASE?
      • MP-RAGE v MR2RAGE?
    • Susceptibility Imaging >
      • What is susceptibility (χ)?
      • What's wrong with GRE?
      • Making an SW image?
      • Phase of blood v Ca++?
      • Quantitative susceptibility?
    • Diffusion: Basic >
      • What is diffusion?
      • Iso-/Anisotropic diffusion?
      • "Apparent" diffusion?
      • Making a DW image?
      • What is the b-value?
      • b0 vs b50?
      • Trace vs ADC map?
      • Light/dark reversal?
      • T2 "shine through"?
      • Exponential ADC?
      • T2 "black-out"?
      • DWI bright causes?
    • Diffusion: Advanced >
      • Diffusion Tensor?
      • DTI (tensor imaging)?
      • Whole body DWI?
      • Readout-segmented DWI?
      • Small FOV DWI?
      • IVIM?
      • Diffusion Kurtosis?
    • Fat-Water Imaging >
      • Fat & Water properties?
      • F-W chemical shift?
      • In-phase/out-of-phase?
      • Best method?
      • Dixon method?
      • "Fat-sat" pulses?
      • Water excitation?
      • STIR?
      • SPIR?
      • SPAIR v SPIR?
      • SPIR/SPAIR v STIR?
  • …Making an Image
    • From Signals to Images >
      • Phase v frequency?
      • Angular frequency (ω)?
      • Signal squiggles?
      • Real v Imaginary?
      • Fourier Transform (FT)?
      • What are 2D- & 3D-FTs?
      • Who invented MRI?
      • How to locate signals?
    • Frequency Encoding >
      • Frequency encoding?
      • Receiver bandwidth?
      • Narrow bandwidth?
      • Slice-selective excitation?
      • SS gradient lobes?
      • Cross-talk?
      • Frequency encode all?
      • Mixing of slices?
      • Two slices at once?
      • Simultaneous Multi-Slice?
    • Phase Encoding >
      • Phase-encoding gradient?
      • Single PE step?
      • What is phase-encoding?
      • PE and FE together?
      • 2DFT reconstruction?
      • Choosing PE/FE direction?
    • Performing an MR Scan >
      • What are the steps?
      • Automatic prescan?
      • Routine shimming?
      • Coil tuning/matching?
      • Center frequency?
      • Transmitter gain?
      • Receiver gain?
      • Dummy cycles?
      • Where's my data?
      • MR Tech qualifications?
    • Image Quality Control >
      • Who regulates MRI?
      • Who accredits?
      • Mandatory accreditation?
      • Routine quality control?
      • MR phantoms?
      • Geometric accuracy?
      • Image uniformity?
      • Slice parameters?
      • Image resolution?
      • Signal-to-noise?
      • Ghosting?
  • …K-space & Rapid Imaging
    • K-space (Basic) >
      • What is k-space?
      • Parts of k-space?
      • What does "k" stand for?
      • Spatial frequencies?
      • Locations in k-space?
      • Data for k-space?
      • Why signal ↔ k-space?
      • Spin-warp imaging?
      • Big spot in middle?
      • K-space trajectories?
      • Radial sampling?
    • K-space (Advanced) >
      • K-space grid?
      • Negative frequencies?
      • Field-of-view (FOV)
      • Rectangular FOV?
      • Partial Fourier?
      • Phase symmetry?
      • Read symmetry?
      • Why not use both?
      • ZIP?
    • Rapid Imaging (FSE &EPI) >
      • What is FSE/TSE?
      • FSE parameters?
      • Bright Fat?
      • Other FSE differences?
      • Dual-echo FSE?
      • Driven equilibrium?
      • Reduced flip angle FSE?
      • Hyperechoes?
      • SPACE/CUBE/VISTA?
      • Echo-planar imaging?
      • HASTE/SS-FSE?
    • Parallel Imaging (PI) >
      • What is PI?
      • How is PI different?
      • PI coils and sequences?
      • Why and when to use?
      • Two types of PI?
      • SENSE/ASSET?
      • GRAPPA/ARC?
      • CAIPIRINHA?
      • Compressed sensing?
      • Noise in PI?
      • Artifacts in PI?
  • …Contrast Agents
    • Contrast Agents: Physics >
      • Why Gadolinium?
      • Paramagnetic relaxation?
      • What is relaxivity?
      • Why does Gd shorten T1?
      • Does Gd affect T2?
      • Gd & field strength?
      • Best T1-pulse sequence?
      • Triple dose and MT?
      • Dynamic CE imaging?
      • Gadolinium on CT?
    • Contrast Agents: Clinical >
      • So many Gd agents!
      • Important properties?
      • Ionic v non-ionic?
      • Intra-articular/thecal Gd?
      • Gd liver agents (Eovist)?
      • Mn agents (Teslascan)?
      • Feridex & Liver Agents?
      • Lymph node agents?
      • Ferumoxytol?
      • Blood pool (Ablavar)?
      • Bowel contrast agents?
    • Contrast Agents: Safety >
      • Gadolinium safety?
      • Allergic reactions?
      • Renal toxicity?
      • What is NSF?
      • NSF by agent?
      • Informed consent for Gd?
      • Gd protocol?
      • Is Gd safe in infants?
      • Reduced dose in infants?
      • Gd in breast milk?
      • Gd in pregnancy?
      • Gd accumulation?
      • Gd deposition disease?
  • …Cardiovascular and MRA
    • Flow effects in MRI >
      • Defining flow?
      • Expected velocities?
      • Laminar v turbulent?
      • Predicting MR of flow?
      • Time-of-flight effects?
      • Spin phase effects?
      • Flow void?
      • Why GRE ↑ flow signal?
      • Slow flow v thrombus?
      • Even-echo rephasing?
      • Flow-compensation?
      • Flow misregistration?
    • MR Angiography - I >
      • MRA methods?
      • Dark vs bright blood?
      • Time-of-Flight (TOF) MRA?
      • 2D vs 3D MRA?
      • MRA parameters?
      • Magnetization Transfer?
      • Ramped flip angle?
      • MOTSA?
      • Fat-suppressed MRA?
      • TOF MRA Artifacts?
      • Phase-contrast MRA?
      • What is VENC?
      • Measuring flow?
      • 4D Flow Imaging?
      • How accurate?
    • MR Angiography - II >
      • Gated 3D FSE MRA?
      • 3D FSE MRA parameters?
      • SSFP MRA?
      • Inflow-enhanced SSFP?
      • MRA with ASL?
      • Other MRA methods?
      • Contrast-enhanced MRA?
      • Timing the bolus?
      • View ordering in MRA?
      • Bolus chasing?
      • TRICKS or TWIST?
      • CE-MRA artifacts?
    • Cardiac I - Intro/Anatomy >
      • Cardiac protocols?
      • Patient prep?
      • EKG problems?
      • Magnet changes EKG?
      • Gating v triggering?
      • Gating parameters?
      • Heart navigators?
      • Dark blood/Double IR?
      • Why not single IR?
      • Triple IR?
      • Polar plots?
      • Coronary artery MRA?
    • Cardiac II - Function >
      • Beating heart movies?
      • Cine parameters?
      • Real-time cine?
      • Ventricular function?
      • Tagging/SPAMM?
      • Perfusion: why and how?
      • 1st pass perfusion?
      • Quantifying perfusion?
      • Dark rim artifact
    • Cardiac III - Viability >
      • Gd enhancement?
      • TI to null myocardium?
      • PS (phase-sensitive) IR?
      • Wideband LGE?
      • T1 mapping?
      • Iron/T2*-mapping?
      • Edema/T2-mapping?
      • Why/how stress test?
      • Stess drugs/agents?
      • Stress consent form?
  • …MR Artifacts
    • Tissue-related artifacts >
      • Chemical shift artifact?
      • Chemical shift in phase?
      • Reducing chemical shift?
      • Chemical Shift 2nd Kind?
      • In-phase/out-of phase?
      • IR bounce point?
      • Susceptibility artifact?
      • Metal suppression?
      • Dielectric effect?
      • Dielectric Pads?
    • Motion-related artifacts >
      • Why discrete ghosts?
      • Motion artifact direction?
      • Reducing motion artifacts?
      • Saturation pulses?
      • Gating methods?
      • Respiratory comp?
      • Navigator echoes?
      • PROPELLER/BLADE?
    • Technique-related artifacts >
      • Partial volume effects?
      • Slice overlap?
      • Aliasing?
      • Wrap-around artifact?
      • Eliminate wrap-around?
      • Phase oversampling?
      • Frequency wrap-around?
      • Spiral/radial artifacts?
      • Gibbs artifact?
      • Nyquist (N/2) ghosts?
      • Zipper artifact?
      • Data artifacts?
      • Surface coil flare?
      • MRA Artifacts (TOF)?
      • MRA artifacts (CE)?
  • …Functional Imaging
    • Perfusion I: Intro & DSC >
      • Measuring perfusion?
      • Meaning of CBF, MTT etc?
      • DSC v DCE v ASL?
      • How to perform DSC?
      • Bolus Gd effect?
      • T1 effects on DSC?
      • DSC recirculation?
      • DSC curve analysis?
      • DSC signal v [Gd]
      • Arterial input (AIF)?
      • Quantitative DSC?
    • Perfusion II: DCE >
      • What is DCE?
      • How is DCE performed?
      • How is DCE analyzed?
      • Breast DCE?
      • DCE signal v [Gd]
      • DCE tissue parmeters?
      • Parameters to images?
      • K-trans = permeability?
      • Utility of DCE?
    • Perfusion III: ASL >
      • What is ASL?
      • ASL methods overview?
      • CASL?
      • PASL?
      • pCASL?
      • ASL parameters?
      • ASL artifacts?
      • Gadolinium and ASL?
      • Vascular color maps?
      • Quantifying flow?
    • Functional MRI/BOLD - I >
      • Who invented fMRI?
      • How does fMRI work?
      • BOLD contrast?
      • Why does BOLD ↑ signal?
      • Does BOLD=brain activity?
      • BOLD pulse sequences?
      • fMRI Paradigm design?
      • Why "on-off" comparison?
      • Motor paradigms?
      • Visual?
      • Language?
    • Functional MRI/BOLD - II >
      • Process/analyze fMRI?
      • Best fMRI software?
      • Data pre-processing?
      • Registration/normalization?
      • fMRI statistical analysis?
      • General Linear Model?
      • Activation "blobs"?
      • False activation?
      • Resting state fMRI?
      • Analyze RS-fMRI?
      • Network/Graphs?
      • fMRI at 7T?
      • Mind reading/Lie detector?
      • fMRI critique?
  • …MR Spectroscopy
    • MRS I - Basics >
      • MRI vs MRS?
      • Spectra vs images?
      • Chemical shift (δ)?
      • Measuring δ?
      • Backward δ scale?
      • Predicting δ?
      • Size/shapes of peaks?
      • Splitting of peaks?
      • Localization methods?
      • Single v multi-voxel?
      • PRESS?
      • STEAM?
      • ISIS?
      • CSI?
    • MRS II - Clinical ¹H MRS >
      • How-to: brain MRS?
      • Water suppression?
      • Fat suppression?
      • Normal brain spectra?
      • Choice of TR/TE/etc?
      • Hunter's angle?
      • Lactate inversion?
      • Metabolite mapping?
      • Metabolite quantitation?
      • Breast MRS?
      • Gd effect on MRS?
      • How-to: prostate MRS?
      • Prostate spectra?
      • Muscle ¹H-MRS?
      • Liver ¹H-MRS?
      • MRS artifacts?
    • MRS III - Multi-nuclear >
      • Other nuclei?
      • Why phosphorus?
      • How-to: ³¹P MRS
      • Normal ³¹P spectra?
      • Organ differences?
      • ³¹P measurements?
      • Decoupling?
      • NOE?
      • Carbon MRS?
      • Sodium imaging?
      • Xenon imaging?
  • ...Artificial Intelligence
    • AI Part I: Basics >
      • Artificial Intelligence (AI)?
      • What is a neural network?
      • Machine Learning (ML)?
      • Shallow v Deep ML?
      • Shallow networks?
      • Deep network types?
      • Data prep and fitting?
      • Back-Propagation?
      • DL 'Playground'?
    • AI Part 2: Advanced >
      • What is convolution?
      • Convolutional Network?
      • Softmax?
      • Upsampling?
      • Limitations/Problems of AI?
      • Is the Singularity near?
    • AI Part 3: Image processing >
      • AI in clinical MRI?
      • Super-resolution?
  • ...Tissue Properties Imaging
    • MRI of Hemorrhage >
      • Hematoma overview?
      • Types of Hemoglobin?
      • Hyperacute/Oxy-Hb?
      • Acute/Deoxy-Hb?
      • Subacute/Met-Hb?
      • Deoxy-Hb v Met-Hb?
      • Extracellular met-Hb?
      • Chronic hematomas?
      • Hemichromes?
      • Ferritin/Hemosiderin?
      • Subarachnoid blood?
      • Blood at lower fields?
    • T2 cartilage mapping
    • MR Elastography?
    • Synthetic MRI?
    • Amide Proton Transfer?
    • MR thermography?
    • Electric Properties Imaging?
  • Copyright/Legal
    • Copyright Issues
    • Legal Disclaimers
  • Forums/Blogs/Links
  • What's New
  • Self-test Quizzes - NEW!
    • Magnets & Scanners Quiz
    • Safety & Screening Quiz
    • NMR Phenomenon Quiz
    • Pulse Sequences Quiz
    • Making an Image Quiz
    • K-space & Rapid Quiz
    • Contrast & Blood Quiz
    • Cardiovascular & MRA Quiz

Types of Deep Neural Networks

What are the various types of deep networks and how are they used?  
Types of Neural networks
As you might imagine, multiple configurations of artificial neurons are possible. Some of the more important neural network variations are briefly cataloged below. The first type, Convolutional Neural Network (CNN), will only be briefly introduced here, but is so important for image-based applications that it will be covered more extensively in several subsequent Q&As. 
Convolutional Neural Networks (CNNs)
CNN is the configuration most widely used for MRI and other image processing applications. Data is typically input as 2D or 3D arrays of pixels and initial computational phases are restricted to operations on neighboring pixels.  
Convolution involves a sliding multiplication and addition of regional data points by a small (~5x5) array of numbers called a kernel or filter. By using sets of different kernels (such as ones that emphasize edges or shapes) a set of regional feature maps is produced. The next step, called subsampling or pooling, shrinks the size of each feature map, typically taking the maximum value in each 2x2 pixel group. The convolution-pooling process may be repeated several times, producing ever smaller "feature of feature" maps. In the final stage, known as vectorization or flattening, all the resultant 2-dimensional arrays are converted into a single long chain of numbers (i.e., a "vector"). The vectorized feature maps then serve as inputs into a fully connected neural network whose output may then be fed back in training to adjust weighting parameters in the convolutional part of the network or used to produce a final result (such as tissue segmentation or classification).
Picture
Building blocks of a CNN
Two basic two types of CNN architectures can be distinguished based on the connection modes between convolutional layers. The first type of CNN architecture (pictured above) connects different convolutional layers in series. Examples include such well-known configurations as AlexNet, VGGNet, and ZFNet. The second architecture connects convolutions in parallel, characterized by the inception modules used by GoogleNet and its progeny.  An inception module consists of multiple parallel convolutional layers, each with a different filter size. The intuition behind this design is that different filter sizes are better suited for detecting features of different scales, which is useful for recognizing objects of various sizes and shapes.   
Encoder-Decoder Networks
​

Encoder-decoder networks are a specific form of CNN widely used for image segmentation, co-registration, and artifact reduction. They typically have a "U-configuration", with an initial contracting path (the encoder) followed by an expansion path (the decoder). Full dimensional images input into the encoder portion are reduced in size via a CNN-type mechanism into lower dimensional data, thus compressing the feature maps. These maps are then re-expanded in the decoder portion back to full size containing the most meaningful features. Loss of spatial resolution can be overcome by inserting skip connections between the sides to pass through important details to the output image. The U-net network (pictured below) and its variants are the most widely used encoder-decoder networks for medical imaging.  
Picture
Basic U-net architecture.
Generative Adversarial Networks (GANs)
GANs consist of two competing components: (1) the Generator, a deconvolutional network that uses random noise and interpolation to generate "fake" but realistic-looking images, and (2) the Discriminator, a conventional CNN previously trained with supervised learning to identify real images at a certain level of accuracy. 
In operation, images are randomly sent to the discriminator, which must then decide which are real and which have been generated.  ​If its judgment is wrong (i.e. a fake image being mistaken for real), the Discriminator will learn from its error and improve on subsequent runs. If its judgment is correct, this information will be sent back to the Generator to help it develop better "fake" images. Through iteration, the performance of both components improve, as does the GAN as a whole.  Applications in MRI include image synthesis, reconstruction, segmentation, co-registration, and production of "super-resolution" images.
Generative Adversarial Network (GAN)
Structure of a Generative Adversarial Network (GAN), where the Generator tries to fool the Discriminator between identifying "real" and "fake" images, with both learning iteratively through the process.
PictureNodes in an RNN can feed recent outputs back
onto themselves or others
Recurrent Neural Networks (RNNs)

​RNNs differ from standard "feed forward"  neural networks in that they contain data feedback loops. This feedback serves as a type of "memory" allowing them to use recent outputs as updated inputs for subsequent calculations. RNNs are useful in the analysis of sequentially acquired data, including time series and speech processing. For example, the location of the cardiac septum on cine MRI depends on its immediately prior position; the next word in a radiology report depends on the previous word. 

Simple feedback loops can only allow a handful of data time steps to be retained and used. To overcome this limitation more complex RNN configurations, including Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRUs), have been developed that allow retention of data over many thousands of time points. Bi-directional RNNs are also possible, where the output of later nodes is fed back to update past nodes in an iterative fashion.  In recent years, Transformer Neural Networks (TNNs) discussed below have largely replaced RNNs and LSTMs for many applications.
Transformer Neural Networks (TNNs)
TNNs, also known simply as Transformers, were developed in 2017 by the Google Brain research group to improve natural language processing.  The wildly popular application, ChatGPT, is based on a transformer architecture. (The letters GPT stand for "Generative Pre-trained Transformer".)  Many transformer variations exist, including hybrid forms with CNNs or GANs.
Transformer networks are different from traditional recurrent neural networks (RNNs) and convolutional neural networks (CNNs) in that they do not use sequential processing or convolutional filters. Instead, they use self-attention mechanisms and parallel processing to handle input sequences.
PictureTypical Transformer architecture with encoder on left and decoder on right.
(From He et al under CC-BY)
The typical Transformer architecture utilizes an Encoder-Decoder arrangement as shown in the diagram right. Transformers are designed to process data in parallel rather than sequentially, so that information is simultaneously passed between the Encoder and Decoder segments. This parallel architecture means that graphics processing units (GPUs) can be used more efficiently increasing the speed of calculations and training.
Two other characteristic features of Transformer Networks are the use of positional encoding and self-attention.  Positional encoding refers to the indexing of words or image segments by original location as they pass through the network. Self-attention refers to the ability of the network to identify the most important features in a data set and their relationships to other data.

A complete description of Transformers is well beyond the scope of this web site, but I suspect you will here more and more about these networks for image-based applications in the future.  Due to the large number of pixels, images are usually broken up into multiple small "patches" (say, 16x16 in size) whose relative locations (positional encodings) are remembered by the system and key features identified (by self-attention mechanisms).  Some recent examples of the use of Transformers in medical imaging applications (including segmentation, classification, and processing) are provided in the references below.

Advanced Discussion (show/hide)»

More information about Transformer Networks in Imaging

While transformer networks were initially introduced for natural language processing, they have also been adapted for image processing tasks. The adaptation of transformer networks for image processing is referred to as vision transformers (ViTs).

The key idea behind ViT is to break down an image into patches and treat them as sequences of input tokens similar to the words in a sentence. Each patch is represented by a feature vector obtained through a convolutional neural network (CNN). These feature vectors are then linearly projected to the same dimension as the word embeddings in the original transformer network. The resulting patch embeddings are treated as input tokens for the transformer network.

The transformer network is then applied to the sequence of patch embeddings, similar to the way it is applied to the sequence of word embeddings in natural language processing. The self-attention mechanism is used to model the dependencies between patches, allowing the network to capture global context information from the entire image. A multi-head self-attention mechanism is also used to allow the network to attend to different parts of the image.

After the self-attention mechanism, the patch embeddings are passed through a feedforward neural network to produce the final output. The output can be used for a variety of image processing tasks, such as image classification or object detection.

ViTs have shown promising results in image processing tasks and has achieved state-of-the-art performance on several benchmarks, particularly in scenarios where the available training data is limited. However, ViT requires more memory and computational resources than traditional CNN-based models, and it may not perform as well on tasks that require fine-grained spatial information, such as segmentation tasks.


References
     
Ali A. Convolutional neural network (CNN) with practical implementation. Wavy AI Research Foundation. Downloaded from https://medium.com/machine-learning-researcher 20 Jan 2022
     Ali A. Recurrent neural network and LSTM with practical implementation. Wavy AI Research Foundation. Downloaded from https://medium.com/machine-learning-researcher 20 Jan 2022.
    Chartrand G, Cheng PM, Vorontsov E, et al. Deep learning: a primer for radiologists. RadioGraphics 2017; 37:2113-2131.  [DOI LINK]
     Cheng PM, Montagnon E, Yamashita R, et al. Deep learning: an update for radiologists. RadioGraphics 2021; 41:1427-1445.  [DOI LINK]
     Dosovitskiy L, Beyer L, Kolesnikov A, et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ArXiv abs/2010.11929 (2021)
     
Giacaglia G. How transformers work. Towards Data Science. 10 Mar 2019
     
He K, Gan C, Li Z, et al. Transformers in medical image analysis. Intelligent Medicine 2023; 3:59-78.  [DOI LINK]
     Lundervold AS, Lundervold A. An overview of deep learning in medical imaging focusing on MRI. Z Med Phys 2019; 29:102-127. [DOI LINK]
     Maier A, Syben C, Lasser T, Riess C. A gentle introduction to deep learning in medical image processing. Z Med Pay 2019; 29:86-101. [DOI LINK]
     Moawad AW, Fuentes DT, ElBanan MG, et al. Artificial intelligence in diagnostic radiology: where do we stand, challenges, and opportunities. J Comput Assist Tomogr 2022; 46:78-90.  [DOI LINK] 
    Nicholson C.  A beginner’s guide to LSTMs and recurrent neural networks. In: A.I. Wiki. A beginner’s guide to important topics in AI, machine learning, and deep learning. (Downloaded from  wiki.pathmind.com 13 Jan 2022)
     
Siddique N, Paheding S, Elkin CP, Devabhaktuni V. U-Net and its variants for medical image segmentation: a review of theory and applications. IEEE Access 2021:82031-82057.  [DOI LINK]
     
Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. (2017-6-12). arXiv1706.03762.    [DOI LINK] (Original paper describing Transformers)
     Willemink MJ, Roth HR, Sandfort V. Two foundational deep learning models for medical imaging in the new era of Transformer Networks. Radiology: Artificial Intelligence 2022; 4(6):e210284.  [DOI LINK]
     Wolterink JM, Mukhopadhyay A, Leiner T, et al. Generative adversarial networks: a primer for radiologists. RadioGraphics 2021; 41:840-857.  [DOI LINK]
     
Yi Xin, Babyn PS. Generative adversarial network in medical imaging: a review. Medical Image Analysis 2019; 58:101552.  [DOI LINK]

Related Questions
     Is artificial intelligence the same as machine learning?
​
     I still don't understand how machines learn. How do they "reprogram" themselves?

←  Previous Question
Next Question  →
↑ Complete List of Questions ↑
© 2024 AD Elster, ELSTER LLC
All rights reserved.   
MRIquestions.com - Home
Donate
Please help keep this site free for everyone in the world!