Datasets

12M Records

ALL

Demographic

Demographic data including gender and year of birth.


44M Records

From 2000

Accident and Emergency Department Attendance

AE attendance data including attendance date (up to month), calculated age on admission, triage category and discharge information.


84M Records

From 1997

Inpatient Admission, Transfer and Discharge

Inpatient episode transaction data including admission and discharge date (up to month), calculated age of admission, admission source, admission specialty and discharge information.


523M Records

From 2000

Outpatient Appointment

Outpatient appointment and attendance data including appointment date (up to month), calculated age on date of appointment, attended specialty and appointment type indicating first attendant or follow-up attendant.


102M Records

ALL

Diagnosis

Diagnosis progress data including patient’s diagnosis, diagnosis status and diagnosis date (up to month).


39M Records

ALL

Procedure

Procedure progress data including procedure and date of procedure (up to month).


1,170M Records

From 2000

Medication

Dispensed prescription data including dispended drug item with corresponding British National Formulary (BNF) code, prescription period and prescribed dosage.


6M Records

From 01 Oct 2009

Immunization

Immunization data of Hospital Authority including injection date (up to month) and vaccine injected.


438M Records

ALL

Family Medicine

Patient disease data including date of the patient disease result (up to month) with ICPC2 code.


743K Records

From 2002

Obstetrics

Obstetrics data including date of baby delivery (up to year), weight of the baby at birth, parity and maturity.


2,538M Records

From 2000

Laboratory Tests and Results

Laboratory result data including chemical pathology, hematology & immunology and microbiology & virology with reference date of the laboratory result (up to month).


131M Records

From 1 Apr 1999

Radiology Examinations

Radiology examination result including registration date of the radiology examination (up to month), examination details and calculated age of the patient on the day of radiology appointment.


130M Records

From 3 Feb 1994

Clinical Note/Summary

Clinical and discharge note including record creation date (up to month) and text content of the clinical note.


23M Records

From 1 Apr 1999

Radiology Report

Text content of the examination report.


(project based)

Radiology Image

Radiology image (DICOM) on project basis.


2,464M Records

From 2003

Generic Clinical Documentation (GCD) Form 

GCD Form at the initial phase includes Stroke Proforma data.

Data Products

Structured data of 14 chronic diseases:

328K Patients

1993-2023

Chronic Heart Failure (CHF)

A progressive condition where the heart’s ability to pump blood efficiently is impaired, leading to inadequate circulation. Symptoms include shortness of breath, fatigue, exercise intolerance, and edema (swelling in the feet, ankles, or abdomen). Common causes include coronary artery disease, history of myocardial infarction, or hypertension. Predominantly affects individuals aged 65+.


739K Patients

1998-2023

Chronic Kidney Disease (CKD) Stage 3A, 3B, 4 & 5

Defined by glomerular filtration rate (GFR) <60 mL/min/1.73m² or structural/functional kidney abnormalities persisting ≥3 months. Diagnostic markers include albuminuria, urinary sediment irregularities, or tubular electrolyte disorders.

Stage 3A: GFR 45–59 mL/min/1.63m²

Stage 3B: GFR 30–44 mL/min/1.73m²

Stage 4: GFR 15–29 mL/min/1.73m²

Stage 5 (Kidney Failure): GFR <15 mL/min/1.73m²


186K Patients

1992-2023

Chronic Obstructive Pulmonary Disease (COPD)

A preventable, progressive respiratory disorder characterized by persistent airflow limitation due to chronic inflammation of airways and lung tissue, typically triggered by exposure to harmful particles/gases (e.g., smoking). Exacerbations and comorbidities (e.g., cardiovascular disease) worsen disease severity.


561K Patients

1993-2023

Coronary Heart Disease (CHD)

Caused by atherosclerotic narrowing/blockage of coronary arteries, reducing blood flow to the heart. Symptoms include angina, dyspnea on exertion, and myocardial infarction. Risk factors: age, smoking, hypertension, hyperlipidemia, diabetes, obesity, and family history of early-onset CHD. Also termed coronary artery disease (CAD).


277K Patients

1993-2023

Dementia

A syndrome marked by progressive cognitive decline, including memory loss, impaired reasoning, personality changes, and difficulty performing daily tasks. Common etiologies include Alzheimer’s disease, cerebrovascular injury, or neurodegenerative disorders. Symptoms worsen over time.


856K Patients

2007-2023

Diabetes Mellitus (DM)

A metabolic disorder characterized by chronic hyperglycemia due to defects in insulin secretion, insulin action, or both. Disrupts carbohydrate, lipid, and protein metabolism. Subtypes include Type 1 (autoimmune) and Type 2 (insulin resistance).


255K Patients

1993-2023

Glaucoma

A group of eye disorders involving increased intraocular pressure, damaging the optic nerve and leading to vision loss. Caused by impaired drainage of aqueous humor, which compresses the retina and optic nerve.


310K Patients

1995-2023

Hepatitis B Carriers

Individuals with persistent hepatitis B virus (HBV) infection, defined as HBsAg-positive or HBV DNA-positive for >6 months. Carriers face elevated risks of cirrhosis and hepatocellular carcinoma (HCC), independent of liver function status.


162K Patients

1992-2023

Hip Fracture (as approximate for Osteoporosis)

A fracture in the proximal femur (upper thigh bone), often caused by trauma or osteoporosis. Common in older adults and associated with reduced mobility.


1293K Patients

1993-2023

Hyperlipidemia (HLD)

A group of metabolic disorders marked by elevated blood lipoprotein levels, including high cholesterol (LDL) and/or triglycerides. Major risk factor for atherosclerosis and cardiovascular disease.


2109K Patients

2007-2023

Hypertension (HT)

Primary hypertension (95% of cases) refers to persistently elevated blood pressure without identifiable cause. Risk factors include age, obesity, salt sensitivity, and genetic predisposition.


76K Patients

1993-2023

Parkinsonism

A clinical syndrome defined by bradykinesia plus ≥1 of the following: resting tremor, rigidity, or postural instability. Causes include Parkinson’s disease, drug-induced dopamine blockade, vascular lesions, or neurodegenerative disorders.


220K Patients

2001-2023

Stroke

Sudden cerebral ischemia (loss of blood flow to the brain) due to thrombosis, embolism, or hemorrhage. Symptoms include hemiparesis, aphasia, and altered consciousness. Risk factors: hypertension, atrial fibrillation, diabetes, smoking, and age. Also termed cerebrovascular accident (CVA).


207K Patients

1993-2023

Depression

A mood disorder characterized by persistent sadness, anhedonia, fatigue, sleep disturbances, and suicidal ideation. Affects 15–25% of cancer patients. Treatable with therapy and/or pharmacotherapy.


 

 

Structured data of 11 cancers (Prevalence provided by Hong Kong Cancer Registry):

84K Patients

2006-2023

Colorectal Cancer

Malignancy arising in the colon or rectum, often from adenomatous polyps.


67K Patients

2006-2023

Breast (Female) Cancer

Most commonly ductal or lobular carcinoma, originating in milk ducts or glands. Invasive forms spread beyond the primary site.


88K Patients

2006-2023

Lung Cancer

Classified as small cell (aggressive, linked to smoking) or non-small cell (e.g., adenocarcinoma, squamous cell).


33K Patients

2006-2023

Prostate Cancer

Adenocarcinoma of the prostate gland, primarily affecting older males.


33K Patients

2006-2023

Liver Cancer

Primary (hepatocellular carcinoma) or secondary (metastatic spread from other organs).


14K Patients

2006-2023

Nasopharynx Cancer

Malignancy in the nasopharyngeal epithelium, linked to Epstein-Barr virus in endemic regions.


20K Patients

2006-2023

Stomach Cancer

Adenocarcinoma of the gastric mucosa. Also termed gastric cancer.


15K Patients

2006-2023

Corpus Cancer

Malignant tumors of the uterine corpus (e.g., endometrial carcinoma, leiomyosarcoma).


9K Patients

2006-2023

Ovary Cancer

Includes epithelial carcinomas (most common) and germ cell tumors. Often diagnosed at advanced stages.


8K Patients

2006-2023

Cervix Cancer

Malignancy of the cervix, frequently associated with HPV infection.


14K Patients

2006-2023

Non-Hodgkin Lymphoma

Diverse group of lymphocyte cancers (B-cell or T-cell origin), distinct from Hodgkin lymphoma.