Skip to main content

DeduplicationConfig

Struct DeduplicationConfig 

Source
pub struct DeduplicationConfig {
    pub enabled: bool,
    pub similarity_thresholds: HashMap<Namespace, f32>,
    pub default_threshold: f32,
    pub recent_window: Duration,
    pub cache_capacity: usize,
    pub min_semantic_length: usize,
}
Expand description

Configuration for the deduplication service.

§Environment Variables

VariableTypeDefaultDescription
SUBCOG_DEDUP_ENABLEDbooltrueEnable deduplication
SUBCOG_DEDUP_THRESHOLD_DECISIONSf320.92Threshold for decisions namespace
SUBCOG_DEDUP_THRESHOLD_PATTERNSf320.90Threshold for patterns namespace
SUBCOG_DEDUP_THRESHOLD_LEARNINGSf320.88Threshold for learnings namespace
SUBCOG_DEDUP_THRESHOLD_DEFAULTf320.90Default threshold
SUBCOG_DEDUP_TIME_WINDOW_SECSu64300Recent capture window
SUBCOG_DEDUP_CACHE_CAPACITYusize1000LRU cache size
SUBCOG_DEDUP_MIN_SEMANTIC_LENGTHusize50Min content length for semantic check

§Example

use subcog::services::deduplication::DeduplicationConfig;
use subcog::models::Namespace;

let config = DeduplicationConfig::default();
assert!(config.enabled);
assert_eq!(config.default_threshold, 0.90);
assert_eq!(config.get_threshold(Namespace::Decisions), 0.92);

Fields§

§enabled: bool

Enable/disable entire deduplication.

§similarity_thresholds: HashMap<Namespace, f32>

Per-namespace similarity thresholds.

§default_threshold: f32

Default threshold when namespace not configured.

§recent_window: Duration

Recent capture time window.

§cache_capacity: usize

Recent capture cache capacity.

§min_semantic_length: usize

Minimum content length for semantic check.

Content shorter than this will skip semantic similarity checking and rely only on exact match and recent capture detection.

Implementations§

Source§

impl DeduplicationConfig

Source

pub fn from_env() -> Self

Creates a new configuration from environment variables.

Falls back to defaults for any unset variables.

§Example
use subcog::services::deduplication::DeduplicationConfig;

let config = DeduplicationConfig::from_env();
// Config is populated from environment with defaults
Source

pub fn get_threshold(&self, namespace: Namespace) -> f32

Gets the similarity threshold for a namespace.

Returns the namespace-specific threshold if configured, otherwise returns the default threshold.

§Arguments
  • namespace - The namespace to get the threshold for
§Example
use subcog::services::deduplication::DeduplicationConfig;
use subcog::models::Namespace;

let config = DeduplicationConfig::default();
assert_eq!(config.get_threshold(Namespace::Decisions), 0.92);
assert_eq!(config.get_threshold(Namespace::Patterns), 0.90);
Source

pub const fn with_enabled(self, enabled: bool) -> Self

Builder method to set enabled state.

Source

pub fn with_threshold(self, namespace: Namespace, threshold: f32) -> Self

Builder method to set a namespace threshold.

Source

pub const fn with_default_threshold(self, threshold: f32) -> Self

Builder method to set the default threshold.

Source

pub const fn with_recent_window(self, duration: Duration) -> Self

Builder method to set the recent window duration.

Source

pub const fn with_cache_capacity(self, capacity: usize) -> Self

Builder method to set the cache capacity.

Source

pub const fn with_min_semantic_length(self, length: usize) -> Self

Builder method to set the minimum semantic length.

Trait Implementations§

Source§

impl Clone for DeduplicationConfig

Source§

fn clone(&self) -> DeduplicationConfig

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for DeduplicationConfig

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for DeduplicationConfig

Source§

fn default() -> Self

Returns the “default value” for a type. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> DynClone for T
where T: Clone,

Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> FromRef<T> for T
where T: Clone,

§

fn from_ref(input: &T) -> T

Converts to this type from a reference to the input type.
§

impl<T> FutureExt for T

§

fn with_context(self, otel_cx: Context) -> WithContext<Self>

Attaches the provided Context to this type, returning a WithContext wrapper. Read more
§

fn with_current_context(self) -> WithContext<Self>

Attaches the current Context to this type, returning a WithContext wrapper. Read more
§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
§

impl<T> IntoRequest<T> for T

§

fn into_request(self) -> Request<T>

Wrap the input message T in a tonic::Request
§

impl<L> LayerExt<L> for L

§

fn named_layer<S>(&self, service: S) -> Layered<<L as Layer<S>>::Service, S>
where L: Layer<S>,

Applies the layer to a service and wraps it in [Layered].
§

impl<T> Pointable for T

§

const ALIGN: usize

The alignment of pointer.
§

type Init = T

The type for initializers.
§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
§

impl<T> PolicyExt for T
where T: ?Sized,

§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] only if self and other return Action::Follow. Read more
§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

§

fn vzip(self) -> V

§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,