Add guide for R#38
Open
david-cortes-intel wants to merge 6 commits into
Open
Conversation
rsiyer-intel
reviewed
Jul 1, 2026
|
|
||
| If requests mostly involve compute-heavy operations (e.g. matrix multiplications, as opposed to fetching data from online databases), it is recommended to limit the number of parallel requests to number of threads or to number of physical cores in the machine, as otherwise requests will compete for resources and this will cause slowdowns and decreased throughput. Likewise, If using [Kubernetes](https://kubernetes.io) (also known as 'k8s'), avoid allocating less than a full CPU core to a compute-heavy pod, and avoid fractional core allocations. | ||
|
|
||
| ## Data frame operations |
Collaborator
There was a problem hiding this comment.
For this section, is it possible to provide a summary/compact recommendation table/decision table (example - data size, operation type, memory constraints) that will help answer "which one should I pick?"
rsiyer-intel
reviewed
Jul 1, 2026
| conda install -c conda-forge libblas=*=*mkl* liblapack=*=*mkl* | ||
| ``` | ||
|
|
||
| **It's highly recommended to install these from the conda-forge channel**, where uploads are performed directly by Intel and the most recent versions are always available, compared to the Anaconda channel. The miniforge distribution by default installs packages exclusively from the conda-forge channel, and as such might be a more desirable choice than others. |
Collaborator
There was a problem hiding this comment.
Will it help to provide links to conda-forge channel here?
rsiyer-intel
reviewed
Jul 1, 2026
|
|
||
| Those sparse objects will be accepted as input by many modeling-related packages, such as `glmnet`, `xgboost`, `ranger`, `rsparse` and others, which have routines to operate efficiently on them. | ||
|
|
||
| As a general rule, sparse representations only start being advantageous when the number of non-zeros in the data is less than 10%, but the exact threshold at which switching is optimal can vary a lot by use-case. If the amount of non-zeros is less than 1% however, it is very unlikely that a regular dense data representation would be more efficient when a sparse format is supported. |
Collaborator
There was a problem hiding this comment.
Please fix following typos -
modifyin → modifying — line 3
environmnet → environment — line 86
sytem (in “sytem level”) → system — line 118
onMKL → oneMKL — line 122
apriori → a priori — line 159
PlumbeR → plumber — line 268
constitude → constitute — line 270
rsiyer-intel
requested changes
Jul 1, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds a guide for optimizing R workflows.