languagechange.models.meaning package¶
Submodules¶
languagechange.models.meaning.clustering module¶
- class languagechange.models.meaning.clustering.APosterioriaffinityPropagation(*args, **kwargs)[source]¶
Bases:
ClusterMixin,BaseEstimatorA class that implements the APP clustering algorithm.
This class is compatible with the [scikit-learn](https://scikit-learn.org) ecosystem.
- Parameters:
damping (float, default=0.9) – Damping factor in the range [0.5, 1.0) is the extent to which the current value is maintained relative to incoming values (weighted 1 - damping). This in order to avoid numerical oscillations when updating these values (messages).
max_iter (int, default=200) – Maximum number of iterations.
convergence_iter (int, default=15) – Number of iterations with no change in the number of estimated clusters that stops the convergence.
copy (bool, default=True) – Make a copy of input data.
preference (array-like of shape (n_samples,) or float, default=None) – Preferences for each point - points with larger values of preferences are more likely to be chosen as exemplars. The number of exemplars, ie of clusters, is influenced by the input preferences value. If the preferences are not passed as arguments, they will be set to the median of the input similarities.
affinity ({'euclidean', 'cosine'}, default='cosine') – Which affinity to use. At the moment
cosine,euclideanare supported. ‘euclidean’ uses the negative squared euclidean distance between points.verbose (bool, default=False) – Whether to be verbose.
random_state (int, RandomState instance or None, default=42) – Pseudo-random number generator to control the starting state. Use an int for reproducible results across function calls.
th_gamma (int, default=0) – Threshold over the aging index gamma. Must be in [1, ∞). Clustering refinement is not enforced when th_gamma=0.
- cluster_centers_indices_¶
Indices of cluster centers.
- Type:
ndarray of shape (n_clusters,)
- cluster_centers_¶
Cluster centers.
- Type:
ndarray of shape (n_clusters, n_features)
- labels_¶
Labels of each point.
- Type:
ndarray of shape (n_samples,)
- affinity_matrix_¶
Stores the affinity matrix used in
fit.- Type:
ndarray of shape (n_samples, n_samples)
- feature_names_in_¶
Names of features seen during fit. Defined only when X has feature names that are all strings.
- Type:
ndarray of shape (n_features_in_,)
- fit(X, y=None)[source]¶
Fit the clustering from features.
- Parameters:
X ({array-like, sparse matrix} of shape (n_samples, n_features)) – Training instances to cluster. If a sparse feature matrix is provided, it will be converted into a sparse
csr_matrix.y (Ignored) – Not used, present here for API consistency by convention.
- Returns:
Returns the instance itself.
- Return type:
self
languagechange.models.meaning.meaning module¶
- class languagechange.models.meaning.meaning.WordSenseInduction[source]¶
Bases:
MeaningModel
- class languagechange.models.meaning.meaning.StaticEmbedding[source]¶
Bases:
ABCPlaceholder base for static embedding types.
- class languagechange.models.meaning.meaning.SGNS[source]¶
Bases:
StaticEmbedding