Can we do topic modeling?

Use case:
based on a software, which other software it is more related to?

How is this done?
1- Calculate topics for corpus based on description (e.g., based on Latent Dirichlet Allocation distance)
2- For each topic, you have the probability of a document to belong to that topic, creating clusters of software.
3- Having a new query (in this case a series of keywords), you would calculate which cluster they are more similar to.

We can also define a metric based on graph similarity (to explore)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we do topic modeling? #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Can we do topic modeling? #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions