中国普外基础与临床杂志

中国普外基础与临床杂志

数据库建设第一部分:个人数据的标签与结构化

查看全文

目的通过对华西肠癌数据库(DACCA)中数据的具体描述,详细解读该数据库中的个人数据是如何完成构建的以及对应数据的标签和结构化。方法采用文字描述的形式。结果对华西 DACCA 中个人数据涉及的 23 项共 18 个分类项目进行了定义及对应的概念设定,对数据库中的每个项目对应的数据标签涉及方式以及对应大数据应用阶段所需要的结构化方式进行了阐述,并对所有分类项目的纠错注意事项也进行了描述,且对其中 3 个涉及隐私的分类项目进行的隐私保护进行了阐述。结论通过对华西 DACCA 中个人数据的构建方式进行详细描述,为未来华西 DACCA 的临床应用提供了标准和依据,也为其他希望做结直肠癌数据库建设的同行提供经验参考。

ObjectiveTo unscramble personal data and its tags and structures of Database from Colorectal Cancer (DACCA) in West China Hospital.MethodThe way of words for description was used.ResultsThe definition and setting of 23 items with 18 categories for the personal data from DACCA in West China Hospital was performed. The relevant data label of each item and the structured way needed at the big data application stage were elaborated and the corrective precautions of classification items were described. The three classification items involved privacy attention were described in detailed.ConclusionsBased on description about personal data from DACCA in West China Hospital, it is provided a clinical standard and guide for analyzing of DACCA in future. It also could provide enough experience for construction of colorectal cancer database by staff from same occupation.

关键词: 结直肠癌; 数据库; 大数据; 个人数据; 标签; 结构化

Key words: colorectal cancer; database; big-data; personal data; tag; structure

引用本文: 汪晓东, 李立. 数据库建设第一部分:个人数据的标签与结构化. 中国普外基础与临床杂志, 2019, 26(3): 335-342. doi: 10.7507/1007-9424.201901070 复制

登录后 ,请手动点击刷新查看全文内容。 没有账号,
登录后 ,请手动点击刷新查看图表内容。 没有账号,
1. SEER, Cancer Stat Facts: Colorectal Cancer. 2018, https://seer.cancer.gov/statfacts/html/colorect.html.
2. National Cancer Database, American College of Surgeons. 2018, https://www.facs.org/quality-programs/cancer/ncdb.
3. Ansa BE, Coughlin SS, Alema-Mensah E, et al. Evaluation of Colorectal Cancer Incidence Trends in the United States (2000-2014). J Clin Med, 2018 Jan 30, 7(2): pii: E22.
4. Lee YC, Lee YL, Chuang JP, et al. Differences in survival between colon and rectal cancer from SEER data. PLoS One, 2013, 8(11): e78709.
5. 汪晓东, 李立. 结直肠肿瘤多学科协作诊治模式的数据体系构建与运作策略. 中国普外基础与临床杂志, 2007, 14(4): 474-476.
6. 吕东昊, 汪晓东, 阳川华, 等. 结直肠肿瘤多学科协作诊治模式的数据库初期建设现状. 中国普外基础与临床杂志, 2007, 14(6): 713-715.
7. 汪晓东, 李立. 真实场景与大数据下的整体微创理念, 大幅提高结直肠癌远期生存率. 中国普外基础与临床杂志, 2019, 26(1): 92-95.
8. 汪晓东, 李希, 何欣林, 等. 数据库研究第一部分: 区域性医疗中心的结直肠癌与人群特征. 中国普外基础与临床杂志, 2019, 26(2): 212-220.
9. Paul Armstrong. The Real Future Of Artificial Intelligence And Cancer. https://www.forbes.com/sites/paularmstrongtech/2018/07/26/the-real-future-of-artificial-intelligence-and-cancer/#16f4e7a54c3a, 2018-7-26.
10. HTML Codes, Editors, and Generators. HTML.am, https://www.html.am/, 2018.
11. Matt West. Using Microdata to Markup Structured Data in Your Web Pages. https://blog.teamtreehouse.com/using-microdata-to-markup-structured-data. 2013.
12. Josh Berkus. Tag All The Things. http://www.databasesoup.com/2015/01/tag-all-things.html. 2015-1-28.
13. 国家职业分类大典修订工作委员会. 中华人民共和国职业分类大典. 第2版. 北京: 中国劳动社会保障出版社, 2015: 1-10.
14. 吴忠观. 人口科学辞典. 成都: 西南财经大学出版社, 1997: 270-315.