共查询到20条相似文献,搜索用时 66 毫秒
1.
Web日志预处理的Clementine方案 总被引:5,自引:0,他引:5
利用Clementine完成Web日志预处理数据流的初步构建,实现了数据清洗、用户识别、会话识别、路径补充4大过程,同时具备日志合并、数据审核、规范编码、外部信息关联等辅助功能。实验研究表明,利用Clementine对Web日志进行预处理是完全可行的,这为在该平台上进一步完成挖掘工作奠定了基础,从一定程度上解决了Web日志挖掘与预处理交由不同工具处理的困境,提高了Web日志挖掘的自动化程度。 相似文献
2.
3.
4.
5.
6.
目的:了解医务人员网络信息行为特征,以促进医院图书馆改善信息服务,提高服务质量。方法:使用Piwik开源分析工具对医院图书馆网站的Web日志文件进行处理,从网络流量、离站链接数量、页面访问次数、用户信息行为时间偏好、浏览器及访问设备等角度分析医务人员网络信息行为。结果:医务人员的信息行为频繁,主要目的是使用电子资源,信息行为发生的时间主要集中于工作时间。结论:医院图书馆应该加强图书馆网站内容建设,优化网站设计,根据信息行为的时间偏好更新信息,加大图书馆宣传力度,提升服务质量。 相似文献
7.
根据HIS数据库中工作人员字典的定义,分析该字典的E-R数据模型设计存在的不足,对数据库结构进行调整,实现了工作人员字典数据的动态特性. 相似文献
8.
选取具有代表性的13种文献信息分析工具,从支持的数据格式、数据预处理、构建的关系矩阵、标准化处理、分析方法、结果的可视化等方面进行了比较,总结了每个工具的优势与不足,并为用户选择合适的分析工具提出了建议。 相似文献
9.
10.
利用Clementine工具实现用户频繁访问路径的挖掘,包括数据预处理、数据格式转化、挖掘分析等3个过程。基于工具的挖掘,可大大缩减数据预处理和序列挖掘的时间。研究证明,实现用户频繁访问路径的Clementine挖掘是一种行之有效的方法,研究中构建的Clementine数据流可继续完善成为网络日志挖掘的应用模版,适合于更加复杂的行为挖掘。 相似文献
11.
12.
13.
14.
15.
Craig Barnes Binam Bajracharya Matthew Cannalte Zakir Gowani Will Haley Taha Kass-Hout Kyle Hernandez Michael Ingram Hara Prasad Juvvala Gina Kuffel Plamen Martinov J Montgomery Maxwell John McCann Ankit Malhotra Noah Metoki-Shlubsky Chris Meyer Andre Paredes Jawad Qureshi Xenia Ritter Philip Schumm Mingfei Shao Urvi Sheth Trevar Simmons Alexander VanTol Zhenyu Zhang Robert L Grossman 《J Am Med Inform Assoc》2022,29(4):619
ObjectiveThe objective was to develop and operate a cloud-based federated system for managing, analyzing, and sharing patient data for research purposes, while allowing each resource sharing patient data to operate their component based upon their own governance rules. The federated system is called the Biomedical Research Hub (BRH).Materials and MethodsThe BRH is a cloud-based federated system built over a core set of software services called framework services. BRH framework services include authentication and authorization, services for generating and assessing findable, accessible, interoperable, and reusable (FAIR) data, and services for importing and exporting bulk clinical data. The BRH includes data resources providing data operated by different entities and workspaces that can access and analyze data from one or more of the data resources in the BRH.ResultsThe BRH contains multiple data commons that in aggregate provide access to over 6 PB of research data from over 400 000 research participants.Discussion and conclusionWith the growing acceptance of using public cloud computing platforms for biomedical research, and the growing use of opaque persistent digital identifiers for datasets, data objects, and other entities, there is now a foundation for systems that federate data from multiple independently operated data resources that expose FAIR application programming interfaces, each using a separate data model. Applications can be built that access data from one or more of the data resources. 相似文献
16.
17.
18.
19.
20.
Kellie M Walters Anna Jojic Emily R Pfaff Marie Rape Donald C Spencer Nicholas J Shaheen Brent Lamm Timothy S Carey 《J Am Med Inform Assoc》2022,29(4):707
Institutions must decide how to manage the use of clinical data to support research while ensuring appropriate protections are in place. Questions about data use and sharing often go beyond what the Health Insurance Portability and Accountability Act of 1996 (HIPAA) considers. In this article, we describe our institution’s governance model and approach. Common questions we consider include (1) Is a request limited to the minimum data necessary to carry the research forward? (2) What plans are there for sharing data externally?, and (3) What impact will the proposed use of data have on patients and the institution? In 2020, 302 of the 319 requests reviewed were approved. The majority of requests were approved in less than 2 weeks, with few or no stipulations. For the remaining requests, the governance committee works with researchers to find solutions to meet their needs while also addressing our collective goal of protecting patients. 相似文献