This repository contains scripts for network analysis of topics discovered by Math Research Compass. The aim is to explore and visualize relationships in mathematical research, leveraging various data science techniques and tools.
For the latest updates and downloads, visit the Releases section.
To get started with this project, you will need to have Python installed on your machine. The scripts utilize libraries such as networkx
, bertopic
, and others to perform analysis on the arXiv dataset.
networkx
, pandas
, numpy
, bertopic
, matplotlib
, seaborn
, scikit-learn
You can install the required libraries using pip:
pip install networkx pandas numpy bertopic matplotlib seaborn scikit-learn
Clone the repository to your local machine:
git clone https://github.com/hevrk/MRC-Network-Analysis.git
cd MRC-Network-Analysis
For specific releases, check the Releases section. Download the necessary files and execute them as instructed.
After setting up the environment, you can run the scripts to perform various analyses. Each script is designed to address different aspects of network analysis.
To execute a script, use the following command in your terminal:
python script_name.py
Replace script_name.py
with the name of the script you want to run.
For instance, to run the topic modeling script, use:
python topic_modeling.py
This script will analyze the arXiv dataset and produce a network graph of topics.
This project delves into several topics relevant to mathematical research and network analysis:
Here is a brief overview of the main scripts available in this repository:
data_extraction.py
This script extracts data from the arXiv API and prepares it for analysis.
topic_modeling.py
Utilizes the BERTopic library to identify and visualize topics within the dataset.
network_visualization.py
Generates visual representations of the research network, showcasing relationships between papers and authors.
collaboration_analysis.py
Analyzes co-authorship patterns and generates metrics related to scientific collaboration.
scientometrics_analysis.py
Focuses on publication trends, citation counts, and other metrics relevant to scientific output.
network_metrics.py
Calculates various network metrics, such as degree centrality and clustering coefficients.
Contributions are welcome! If you have suggestions for improvements or new features, please fork the repository and submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.
For any inquiries or suggestions, please reach out to the repository maintainer:
For updates and downloads, donβt forget to check the Releases section.