Latest News

The 8 best open source tools for data mining

January 26, 2021/in Expert Research Service /by support service

Data mining is also known as data exploration. It is a step in Knowledge-Discovery in Databases, a process of mining and analyzing large amounts of data and extracting information from it. Some of these applications include market segmentation-such as identifying the characteristics of a customer buying a specific product from a specific brand, fraud detection-identifying transaction patterns that may lead to online fraud, etc. In this article, we have compiled the 8 best open source tools for data mining.

1.weka

As an open data mining platform, WEKA has assembled a large number of machine learning algorithms that can undertake data mining tasks, including data preprocessing, classification, regression, clustering, association rules, and visualization on a new interactive interface.

2.Rapid Miner

RapidMiner is the world’s leading data mining solution, with advanced technology to a very large extent. Its data mining tasks cover a wide range, including various data arts, which can simplify the design and evaluation of the data mining process.

3. Orange

Orange is a component-based data mining and machine learning software package. Its functions are friendly, powerful, fast and multi-functional visual programming front end for browsing data analysis and visualization, and it is based on Python for script development. . It contains a complete series of components for data preprocessing, and provides data accounting, transition, modeling, model evaluation and exploration functions. It is developed by C++ and Python, and its graphics library is developed by the cross-platform Qt framework.

4. Knime

KNIME (Konstanz Information Miner) is a user-friendly, intelligent, and open source platform for data integration, data processing, data analysis and data exploration.

5. jHepWork

jHepWork is a complete set of object-oriented scientific data analysis framework. Jython macros are used to display one-dimensional and two-dimensional histogram data. The program includes many tools that can be used to interact with two-dimensional and three-dimensional scientific graphics.

6. Apache Mahout

Apache Mahout is a brand new open source project developed by the Apache Software Foundation (ASF). Its main goal is to create some scalable machine learning algorithms for developers to use for free under the Apache license. The project has reached its second year and currently only has one public release. Mahout contains many implementations, including clustering, classification, CP, and evolutionary programs. In addition, by using the Apache Hadoop library, Mahout can be effectively extended to the cloud.

7. ELKI

ELKI (Environment for Developing KDD-Applications Supported by Index-Structures) is mainly used to cluster and find outliers. ELKI is a data mining platform similar to weka, written in java, with a GUI graphical interface. Can be used to find outliers.

8. Rattle

Rattle (an easy-to-learn R analysis tool) provides statistical and visual summaries of data, converts data into easy-to-model forms, constructs unsupervised and supervised models from the data, presents the performance of the model graphically, and obtains new data set.

Common mistakes in data analysis

January 25, 2021/0 Comments/in Expert Research Service /by support service

The background of big data is that the whole society is going digital, especially the development of social networks and various sensing devices. The development of cloud computing and search engines has made it possible to efficiently analyse big data. The core issue is how to quickly obtain valuable information from a large variety of data. Realizing corporate strategic operations through data analysis has become the norm, so what are the common errors in the data analysis process?

　Common errors in the process of data analysis:

　　1. The analysis goal is not clear

　　”Massive data does not actually produce massive wealth.” Because many data analysts do not have clear analysis goals, they are often confused in the massive data. Either the wrong data is collected, or the collected data is not complete, which will lead to the results of data analysis are not accurate enough.

　　However, if the target is locked at the beginning, what exactly do you want to analyze? If you think about results oriented, you will know what kind of data you need to support your analysis. In order to determine the source of the data, collection methods and analysis indicators.

　　2. Errors occur when collecting data

　　When the software or hardware that captures the data goes wrong, a certain error occurs. For example, if the usage log is not synchronized with the server, the user behavior information on the mobile application may be lost. Likewise, if we use hardware sensors like microphones, our recordings may capture background noise or other electrical signal interference.

　　3. The sample is not representative

　　When performing data analysis, there must be a credible data sample. This is the key to ensuring that the data analysis result is reliable. If the data sample is not representative, the final analysis result will be of no value. Therefore, for data samples, it is also required to be complete and comprehensive. Use single, non-representative data to replace all data for analysis. The analysis results obtained from such one-sided data may be completely wrong.

　　For example, Twitter users may be more educated and have higher incomes, and their age will be somewhat older. If such a biased sample is used to predict the box office of a movie whose target audience is young people, the analysis conclusion may not be reasonable. So make sure that the sample data you get is representative of the research population. Otherwise, your analysis and conclusions lack a solid foundation.

　　4. Correlation and causality confusion

　　Most data analysts assume that correlation directly affects causality when dealing with big data. Using big data to understand the correlation between two variables is usually a good practice method, but always using the “causal” analogy can lead to false predictions and invalid decisions. To achieve good results in data analysis, we must understand the fundamental difference between correlation and causality. Correlation often refers to observing changes in X and Y at the same time, while causality means that X leads to Y. In data analysis, these are two completely different things, but many data analysts often overlook the difference.

　　Correlationship in data science is not causation.If two relationships are related to each other, it does not mean that one caused the other.

　　5. Divorce from business reality

　　A professional data analyst must be very familiar with the industry situation, business process, and related knowledge of the project being analysed, because the result of data analysis is to solve the problems in the project or provide reference opinions for industry decision makers. If the business knowledge and data analysis work cannot be combined well, and the business reality is divorced and only concerned with the data, the analysis results obtained in this case will not have reference value.

　　6. Passionate about advanced analysis

　　Some data analysts will excessively pursue the so-called cutting-edge, advanced and fashionable analysis technology. When facing an analysis project, the first thing they think of is to choose a cutting-edge technology to solve it, rather than thinking from the real needs of the subject itself. Reasonable and cost-effective analysis technology. If you can get the same result in a simple way, there is no need to quote a complex data analysis model.

Heavy! Collection of blacklists and early warning journals of all units in China!!

January 25, 2021/0 Comments/in Expert Research Service /by support service

Recently, the First Hospital of Jilin University has compiled warning journals:

Medicine
International journal of clinical and experimental medicine
PLoS one
Scientific reports
Oncology letters
Experimental and therapeutic medicine
Biochemical and biophysical research communications
British journal of biomedical science
Cancer radiotherapies
International journal of molecular medicine
International journal of osteopathic medicine
Journal of genetic counseling
Material science in semiconductor processing
Journal of cellular Biochemistry
Biomedicine and pharmacotherapy
Journal of cellular physiology
Life Sciences
European review for medical and pharmacological sciences
Cancer biomarkers
International journal of clinical and experimental pathology
Caner management and research
American journal of cancer research
American journal of translational research
Biomed research international
Bioscience reports
International journal of biochemistry & cell biology
International journal of oncology
Journal of cancer
Journal of cellular and molecular medicine
Journal of clinical medicine
Journal of experimental & clinical Cancer research
Journal of international medical research,
Molecular medicine reports
Oncology research
Oncotargets and therapy
Theranostics
World journal of Gastroenterology
Artificial cells nanomedicine and biotechnology
Experimental and molecular pathology
Biofactors
Brazilian journal of medical and biological research
International journal of immunopathology and pharmacology
Medical science monitor
Bio-medical research Tokyo

On December 31, 2020, The Chinese Academy of Sciences officially released the “International Journal Early Warning List (Trial)”

Metal
Coatings
Materials
Journal of nanoscience and nanotechnology
Minerals
Atmosphere
Artificial cells nanomedicine and biotechnology
Advances in civil engineering
International journal of energy research
Mathematical problems in engineering
Sensors
Energies
Applied sciences-base1
Polymers
Electronics
Processes
Complexity
Desalination and water treatment
International journal of electrochemical science
Catalysts
Molecules
Natural product research
Sustainability
Water
Ieee access
Agronomy-base1
Journal of cellular biochemistry
Journal of cellular physiology
Bioscience reports
Biomed research international
Plant-base1
Cells
Boundary value problems
Advances in difference equations
Mathematics
European review for medical and pharmacological sciences
International journal of clinical and experimental pathology
Medicine
International journal of clinical and experimental medicine
Biomedicine and pharmacotherapy
Experimental and molecular pathology
Brazilian journal of medical and biological research
International journal of immunopathology and pharmacology
Medical science monitor
American journal of translational research
Journal of biomaterials and tissue engineering
Aging-us
Life sciences
Journal of clinical medicine
International journal of environmental research and public health
Acta medica mediterranea

The First Affiliated Hospital of Sun Yat-sen University does not support the journal catalog

1. European review for medical and pharmacologicalsciences

2. Cancer management and research

3. Bioscience reports

4. Cancer biomarkers

5. Journal of International medicalresearch

6. Journal of cellular biochemistry

7. Biochemical and biophysical researchcommunications

8. Biomedicine and pharmacotherapy

9. American journal of cancer research

10. Journal of cellular physiology

11. Life sciences

12. Journal of cellular and molecularmedicine

13. Theranostics

14. Journal of Experimental and clinicalcancer research

15. Journal of cancer

16. International journal of molecularmedicine

17. American journal of translationalresearch

18. Biomed research international

19. Journal of clinical medicine

20. Oncotarget

21. Medicine

22. Scientific reports

23. Tumor biology

24. International journal of biochemistryand cell bilogy

25. Biomedical research-INDIA

26. Cellular physiology and biochemistry

27. International journal of Clinical and experimentalmedicine

28. International journal of clinical andexperimental pathology

29. Experimental and therapeutic medicine

30. Molecular medicine reports

31. Medical science monitor

32. Oncology letters

33. International journal of oncology

34. World journal of gastroenterology

35. oncology research

36. Oncotargets and therapy

37. Plos one

The 8 best open source tools for data mining

Common mistakes in data analysis

Heavy! Collection of blacklists and early warning journals of all units in China!!

Writing requirements for SCI Research paper

Download research papers working links

How to Write a Curriculum Vitae (CV) for a Job Application

About Us

Home

Address

Business Hours