hello everyones so im trying to set up a new hpc cluster i made an account and added users and im using a partition but whenerver i run a job it gives me an error that request node configuration is not available i checked my slurm.conf but it seems good to me i need some help the error Batch job
Tag: cluster-computing
install python packages using init scripts in a databricks cluster
I have installed the databricks cli tool by running the following command pip install databricks-cli using the appropriate version of pip for your Python installation. If you are using Python 3, run pip3. Then by creating a PAT (personal-access token in Databricks) I run the following .sh bash script: python_…
job can’t be submitted inside sge file
I want to submit a sge job via sge file. For example, I have run.sge file as follows: And run_inp.sge file as follows: Whenever I submit job via I got this error: But if I submit run_inp.sge directly, it works fine: My question is that can I submit sge jobs inside a sge job? If not, is there alternative way
How to restart if NodeJS API service failed?
I’ve the similar NodeJS code: cluster.js As you can see in the above code I’m using NodeJS Manual cluster method instead of PM2 cluster, because I need to monitor my API via Prometheus. I’m usually starting the cluster.js via pm2 start cluster.js, however due to some DB connection our app.js…
How does an odd number solve a split brain in a distributed system?
Distributed system suggest using odd number of Master nodes, like 3 Master nodes or 5 Master nodes to avoid split brain problem. But how does it solve the problem? If there’s 2 nodes ( A and B ), 1 Moderator, if A and B tell the Moderator that “I’m Master”, then brain split occurs. The…
How to feed a large number of samples in parallel to linux?
I’m trying run following command on a large number of samples. I have: but I have thousands of these samples to process. Each sample takes about a day or two to finish on my local computer. I’m using a shared linux cluster and a job scheduling system called Slurm, if that helps. Answer Write a sub…